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TESTIS SPECIFIC GLYCOPROTEIN ZPEP10 
REFERENCE TO RELATED APPLICATIONS 

This application is a divisional of application Ser. No. 
09/441346 filed Nov. 16, 1999 now issued as U.S. Pat. No. 
6,242,588 which is related to Provisional Application No. 
60/109,216, filed on Nov. 20, 1998. Under 35 U.S.C. § 
119(eXl), this application claims benefit of said Provisional 
Application. 

BACKGROUND OF THE INVENTION 

The testis is the center for spermatogenesis, the process by 
which a germ cell proceeds through multiple stages of 
differentiation, and culminates in the formation of a termi- 
nally differentiated cell (spermatozoa or sperm) having a 
unique function. Within the testis are seminiferous tubules, 
where spermatogonium mature into spermatozoa. Surround- 
ing the seminiferous tubules are interstitial cells which 
secrete androgens, such as testosterone, required for matu- 
ration and function of the testis and development of sec- 
ondary sexual characteristics. Disorders of the testis are 
common and have profound effect. Infertility can result from 
disorders occurring during spermatogenesis. Many develop- 
mental disorders, such as hypogonadism, are associated with 
altered sex hormone production and levels in the testis. 
Testicular cancer, although rare, is the most common form of 
cancer in young men between the ages of 15 and 35. 

Testis specific proteins have therapeutic value in the 
treatment of disorders associated with the testis such as 
dysfunctional sperm production, infertility and testicular 
cancer. Towards this end, the present invention provides 
novel testis- specific membrane glycoproteins, soluble 
ligands, agonists and antagonists, related compositions and 
methods as well as other uses that should be apparent to 
those skilled in the art from the teachings herein. 

BRIEF DESCRIPTION OF THE DRAWING 

FIGS. 1A-K are a Hopp/Woods hydrophilicity profile of 
the amino acid sequence shown in SEQ ID NO:2. The profile 
is based on a sliding six-residue wiodow. Buried G, S, and 
T residues and exposed II, Y, and W residues were ignored. 
These residues are indicated in the FIG. by lower case 
letters. 

DETAILED DESCRIPTION OF THE 
INVENTION 

Prior to setting forth the invention in detail, it may be 
helpful to the understanding thereof to define the following 
terms: 

The term "affinity tag" is used herein to denote a peptide 
segment that can be attached to a polypeptide to provide for 
purification or detection of the polypeptide or provide sites 
for attachment of the polypeptide to a substrate. In principal, 
any peptide or protein for which an antibody or other 
specific binding agent is available can be used as an affinity 
tag. Affinity tags include a poly-histidine tract, protein A 
(Nilsson et al., EMBO J. 4:1075, 1985; Nilsson et al., 
Methods Enzymol. 198:3, 1991), glutathione S transferase 
(Smith and Johnson, Gene 67:31, 1988), Glu— Glu affinity 
tag (Grussenmeyer et al., Proc. Natl. Acad. Set. USA 
82:7952-4, 1985), substance P, Hag™ peptide (Hopp et al., 
Biotechnology 6:1204-10, 1988; available from Eastman 
Kodak Co., New Haven, Conn.), streptavidin binding 
peptide, or other antigenic epitope or binding domain. See, 
in general Ford el al., Protein Expression and Purification 2: 
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95-107, 1991. DNAs encoding affinity tags are available 
from commercial suppliers (e.g., Pharmacia Biotech, 
Piscataway, NJ.). 
The term "allelic variant" denotes any of two or more 

5 alternative forms of a gene occupying the same chromo- 
somal locus. Allelic variation arises naturally through 
mutation, and may result in phenotypic polymorphism 
within populations. Gene mutations can be silent (no change 
in the encoded polypeptide) or may encode polypeptides 

10 having altered amino acid sequence. The term allelic variant 
is also used herein to denote a protein encoded by an allelic 
variant of a gene. 

The terms "amino -terminal" and "carboxyl -terminal" are 
used herein to denote positions within polypeptides and 

j5 proteins. Where the context allows, these terms are used 
with reference to a particular sequence or portion of a 
polypeptide or protein to denote proximity or relative posi- 
tion. For example, a certain sequence positioned carboxyl- 
terminal to a reference sequence within a protein is located 

2Q proximal to the carboxyl terminus of the reference sequence, 
but is not necessarily at the carboxyl terminus of the 
complete protein. 

The term "complements of polynucleotide molecules" 
denotes polynucleotide molecules having a complementary 

25 base sequence and reverse orientation as compared to a 
reference sequence. For example, the sequence 5' ATG- 
CACGGG 3' is complementary to 5' CCCGTGCAT 3'. 

The term "contig" denotes a polynucleotide that has a 
contiguous stretch of identical or complementary sequence 

3Q to another polynucleotide. Contiguous sequences are said to 
"overlap" a given stretch of polynucleotide sequence either 
in their entirety or along a partial stretch of the polynucle- 
otide. For example, representative contigs to the polynucle- 
otide sequence 5 , -ATGGCTTAGCTT-3 , (SEQ ID NO: 12) 

35 are S'-TAGCTTgagtct-S' (SEQ ID NO: 13) and 
3'-gtcgacTACCGA-5» (SEQ ID NO: 14). 

The term "degenerate nucleotide sequence" denotes a 
sequence of nucleotides that includes one or more degener- 
ate codons (as compared to a reference polynucleotide 

40 molecule that encodes a polypeptide) Degenerate codons 
contain different triplets of nucleotides, but encode the same 
amino acid residue (i.e., GAU and GAC triplets each encode 
Asp). 

The term "expression vector" denotes a DNA molecule, 

45 linear or circular, that comprises a segment encoding a 
polypeptide of interest operably linked to additional seg- 
ments that provide for its transcription. Such additional 
segments may include promoter and terminator sequences, 
and may optionally include one or more origins of 

50 replication, one or more selectable markers, an enhancer, a 
polyadenylation signal, and the like. Expression vectors are 
generally derived from plasmid or viral DNA or may 
contain elements of both. 
The term "isolated", when applied to a polynucleotide, 

55 denotes that the polynucleotide has been removed from its 
natural genetic milieu and is thus free of other extraneous or 
unwanted coding sequences, and is in a form suitable for use 
within genetically engineered protein production systems. 
Such isolated molecules are those that are separated from 

60 their natural environment and include cDNA and genomic 
clones. Isolated DNA molecules of the present invention are 
free of other genes with which they are ordinarily associated, 
but may include naturally occurring 5' and 3' untranslated 
regions such as promoters and terminators. The identifica- 

65 tion of associated regions will be evident to one of ordinary 
skill in the art (see for example, Dynan and Tijan, Nature 
316:774-78, 1985). 
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An "isolated" polypeptide or protein is a polypeptide or sequences are commonly, but not always, found in the 5' 

protein that is found in a condition other than its native non-coding regions of genes. 

environment, such as apart from blood and animal tissue. In A "protein" is a macromolecule comprising one or more 

a preferred form, the isolated polypeptide is substantially polypeptide chains. A protein may also comprise non- 
free of other polypeptides, particularly other polypeptides of 5 peptidic components, such as carbohydrate groups. Carbo- 

animal origin. It is preferred to provide the polypeptides in hydrates and other non-peptidic substituents may be added 

a highly purified form, i.e. greater than 95% pure, more to a protein by the cell in which the protein is produced, and 

preferably greater than 99% pure. When used in this context, ^ varv w* 10 the tv P e of ^ Proteins are defined herein in 

the term "isolated" does not exclude the presence of the terms of their acid backbone structures; substituents 

same polypeptide in alternative physical forms, such as to such as carbohydrate groups are generally not specified, but 

dimers or alternatively glycosylated or derivatized forms. ma y P rcsenl nonelheIess - 

■-pi . u „ .ui„ ,• . c - . r^KTA The term "receptor" denotes a cell-associated protein that 

The term "operably hnked , when referring to DNA , . , 4 ,. J . . .. « * .. . 

. , , . , i binds to a bioactive molecule (i.e., a hcand) and mediates 

segments, denotes that the segments are arranged so that . _ , , v „ * ' , , 

A • , t X. • • . j j the effect of the ugand on the cell. Membrane-bound recep- 

they function in concert for their intended purposes, e.g. , ... , • . • • 

, •.. .v. . j ~j t °, 1<; tors are characterized by a multi-domain structure compns- 

transcnption mitiates in the promoter and proceeds through I s . „ , .. 7 , , , y , 

M M . _ ,u„ mg an extracellular hgand-binding domain and an mtracel- 

the coding segment to the terminator. . f _ t , -.,_.-.-„. , , - , 

hilar effector domain that is typically involved in signal 

The term "ortholog" denotes a polypeptide or protein transduction. Binding of ligand to receptor results in a 

obtained from one species that is the functional counterpart conformational change in the receptor that causes an inter- 

of a polypeptide or protein from a different species. action between the effector domain and other molecule(s) in 

Sequence differences among orthologs are the result of the cell. This mteraction in turn leads to an alteration in the 

speciation. ^ metabolism of the cell. Metabolic events that are linked to 

The term "polynucleotide" denotes a single- or double- receptor-ligand interactions include gene transcription, 

stranded polymer of deoxyribonucleotide or ribonucleotide phosphorylation, dephosphorylation, increases in cyclic 

bases read from the 5' to the 3' end. Polynucleotides include 25 AMP production, mobilization of cellular calcium, mobili- 

RNA and DNA, and may be isolated from natural sources, zat i 0 n of membrane lipids, cell adhesion, hydrolysis of 

synthesized in vitro, or prepared from a combination of inositol lipids and hydrolysis of phospholipids. Most nuclear 

natural and synthetic molecules. Sizes of polynucleotides receptors also exhibit a multi-domain structure, including an 

are expressed as base pairs (abbreviated "bp"), nucleotides amino-terminal, transactivating domain, a DNA binding 

("nt"), or kilobases ("kb"). Where the context allows, the 3Q domain and a ligand binding domain. In general, receptors 

latter two terms may describe polynucleotides that are ca n be membrane bound, cytosolic or nuclear; monomeric 

single-stranded or double-stranded. When the term is (e.g., thyroid stimulating hormone receptor, beta-adrenergic 

applied to double-stranded molecules it is used to denote receptor) or multimeric (e.g., PDGF receptor, growth hor- 

overall length and will be understood to be equivalent to the m one receptor, IL-3 receptor, GM-CSF receptor, G-CSF 

term "base pairs". It will be recognized by those skilled in 35 receptor, erythropoietin receptor and IL-6 receptor), 

the art that the two strands of a double-stranded polynucle- ^ term -secretory signal sequence" denotes a DNA 

otide may dtfer slightly in length and that the ends thereof sequence that encodes a polypeptide (a "secretory peptide") 

may be staggered as a result of enzymatic cleavage; thus all that ^ ^ a component Q f a largcr polypeptide, directs the 

nucleotides within a double-stranded polynucleotide mol- larger polypeptide through a secretory pathway of a cell in 

ecule may not be paired. Such unpaired ends will in general ^ which it ^ S y nt hesized. The larger peptide is commonly 

not exceed 20 nl in length. cleaved to remove the secretory peptide during transit 

A "polypeptide" is a polymer of amino acid residues through the secretory pathway, 

joined by peptide bonds, whether produced naturally or Th c term "soluble receptor" is used herein to refer to a 

synthetically. Polypeptides of less than about 10 amino acid receptor polypeptide that is not bound to a cell membrane, 

residues are commonly referred to as "peptides". 45 Soluble receptors are most commonly ligand-binding recep- 

"Probes and/or primers" as used herein can be RNA or tor polypeptides that lack transmembrane and cytoplasmic 

DNA. DNA can be either cDNA or genomic DNA. Poly- domains. Soluble receptors can comprise additional amino 

nucleotide probes and primers are single or double-stranded acid residues, such as affinity tags that provide for purifica- 

DNA or RNA, generally synthetic oligonucleotides, but may tion of the polypeptide or provide sites for attachment of the 

be generated from cloned cDNAor genomic sequences or its 50 polypeptide to a substrate. Many cell-surface receptors have 

complements. Analytical probes will generally be at least 20 naturally occurring, soluble counterparts that are produced 

nucleotides in length, although somewhat shorter probes by proteolysis or translated from alternatively spliced 

(14-17 nucleotides) can be used. PCR primers are at least 5 mRNAs. Receptor polypeptides are said to be substantially 

nucleotides in length, preferably 15 or more nt, more pref- free of transmembrane and intracellular polypeptide seg- 

erably 20-30 nt. Short polynucleotides can be used when a 55 ments when they lack sufficient portions of these segments 

small region of the gene is targeted for analysis. For gross to provide membrane anchoring or signal transduction, 

analysis of genes, a polynucleotide probe may comprise an respectively. 

entire exon or more. Probes can be labeled to provide a The term "splice variant" is used herein to denote alter- 

detectable signal, such as with an enzyme, biotin, a native forms of RNA transcribed from a gene. Splice varia- 

radionuclide, fluorophore, chemiluminescer, paramagnetic 60 tion arises naturally through use of alternative splicing sites 

particle and the like, which are commercially available from within a transcribed RNA molecule, or less commonly 

many sources, such as Molecular Probes, Inc., Eugene, between separately transcribed RNA molecules, and may 

Greg., and Amersham Corp., Arlington Heights, HI., using result in several mRNAs transcribed from the same gene, 

techniques that are well known in the art. Splice variants may encode polypeptides having altered 

The term "promoter" denotes a portion of a gene con- 65 amino acid sequence. The term splice variant is also used 

taining DNA sequences that provide for the binding of RNA herein to denote a protein encoded by a splice variant of an 

polymerase and initiation of transcription. Promoter mRNA transcribed from a gene. 
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Molecular weights and lengths of polymers determined by joined by a peptide bond, the first portion comprising a 

imprecise analytical methods (e.g., gel electrophoresis) will polypeptide as described herein, and the second portion 

be understood to be approximate values. When such a value comprising another polypeptide. 

is expressed as "about" X or "approximately" X, the stated The invention also provides a polypeptide as described 

value of X will be understood to be accurate to ±10%. 5 herein in combination with a pharmaceutical^ acceptable 

All references cited herein are incorporated by reference Ve «r!L- „l . .u • a a 

. *^ 3 Within another aspect the invention provides an antibody 

in tneir entirety. thal specin cally binds to an ep j 10 pe of a polypeptide of as 

Within one aspect the invention provides an isolated described herein. Within one embodiment the antibody is 

polypeptide comprising an extracellular domain, wherein selected from the group consisting of: a) polyclonal anti- 

the extracellular domain comprises amino acid residues 22 10 body; b) murine monoclonal antibody; c) humanized anti- 

to 111 of the amino acid sequence of SEQ ID NO:2. Within body derived from b); and d) human monoclonal antibody, 

one embodiment polypeptide further comprises a transmem- Within another embodiment the antibody fragment is 

brane domain that resides in a carboxyl-terminal position selected from the group consisting of F(ab'), F(ab), Fab', 

relative to the extracellular domain, wherein the transmem- Fab, Fv, scFv, and minimal recognition unit, 

brane domain comprises amino acid residues 112 to 133 of 15 Also provides is an anti-idiotype antibody that specifically 

the amino acid sequence of SEO ID NO:2. Within another binds t0 an antibody as described herein. Also provided is a 

embodiment the polypeptide further comprises a cytoplas- binding protein that specificaUy binds to an epitope of a 

mic domain that resides in a carboxyl-terminal position polypeptide as described herein. 

relative to the transmembrane domain, wherein the cyto- W,lnm mother aspect the invention provides a method of 

plasmic domain comprises amino acid residues 134 to 142 20 Inducing an antibody to a polypeptide comprising: inocu- 

of the amino acid sequence of SEQ ID NO:2. Within another latmg an animal with a polypeptide as described herein; 

embodiment the polypeptide further comprises a secretory where ; n the V^yptpUdt elicits an immune response in the 

signal that resides in an amino-terminal position relative to amma ' 10 P mdu ™ the anllbod y» and *° lalm S tne anlibod y 

the extracellular domain, wherein the secretory signal ammaL 

sequence comprises amino acid residues 1 to 20 of the amino 25 . Wlthm ™ oihcT 15 P r0Vlded plated polynuc e- 

acid sequence of SEQ ID NO:2. otlde cncodlQ g a polypeptide comprising an extracellular 

^ , . , .... . . domain, wherein the extracellular domain comprises amino 

The invention also provides an isolated polypeptide as add residues 22 tQ m of ^ amino ^ ce of SEQ 

described herein comprising amino acid residue 1 to ammo , D N0;2 ^ one embodimeQt the polypi 

acid residue 142 of SEQ ID NO: 2. 3Q comprises a transmemb rane domain that resides in a 

Also provided is an isolated polypeptide as described carboxyl-terminal position relative to the extracellular 

herein, covalently linked amino terminally or carboxy ter- domain, wherein the transmembrane domain comprises 

minally to a moiety selected from the group consisting of amino ac jd residues 112 to 133 of the amino acid sequence 

affinity lags, toxins, radionuclides, enzymes and fluoro- c f SEQ ID NO.2. Within another embodiment the polypep- 

phores. 3S tide further comprises a cytoplasmic domain that resides in 

Within another aspect the invention provides an isolated a carboxyl-terminal position relative to the transmembrane 

polypeptide comprising a sequence of amino acid residues domain, wherein the cytoplasmic domain comprises amino 

that is at least 80% identical to a amino acid residue 21 to acid residues 134 to 142 of the amino acid sequence of SEQ 

amino acid residue 142 of SEQ ID NO:2, wherein the ID NO:2. Within yet another embodiment the polypeptide 

polypeptide specifically binds with an antibody that specifi- 40 further comprises a secretory signal that resides in an 

cally binds with a polypeptide having the amino acid amino-terminal position relative to the extracellular domain, 

sequence of SEQ ID NO:2. Within one embodiment any wherein the secretory signal sequence comprises amino acid 

difference between the amino acid sequence of the isolated residues 1 to 20 of the amino acid sequence of SEQ ID 

polypeptide and the corresponding amino acid sequence of NO:2. 

SEQ ID NO:2 is due to a conservative amino acid subsli- 45 The invention also provides an isolated polynucleotide as 

tution. Within another embodiment the amino acid percent described herein encoding a polypeptide comprising amino 

identity is determined using a FASTA program with ktup-1, acid residue 1 to amino acid residue 142 of SEQ ID NO:2. 

gap opening penalty«10, gap extension penalty=l, and sub- Also provided is an isolated polynucleotide as described 

stitution matrix=blosum62, with other parameters set as herein, wherein the polypeptide is covalently linked amino 

default. 50 terminally or carboxy terminally to a moiety selected from 

The invention provides an isolated polypeptide compris- tae group consisting of affinity lags, toxins, 

ing the amino acid sequence of amino acid residue 1 to radionucleolides, enzymes and fluorophores. 

amino acid residue 20 of SEQ ID NO:2. Within another aspect the invention provides an isolated 

Also provided is an isolated polypeptide selected from the Polynucleotide encoding a polypeptide comprising a 

group consisting of: 55 sequence of amino acid residues that is at least 80% identical 

. . . '. . , , , _ .„ _ to a amino acid residue 21 to amino acid residue 142 of SEQ 

a) amino acid residues 21-111 of SEQ ID NO:2; ID NO:2, wherein the polypeptide specifically binds with an 

b) ammo acid residues 112-133 of SEQ ID NO:2; antibody that specifically binds with a polypeptide having 

c) amino acid residues 134-142 of SEQ ID NO:2; the amino acid sequence of SEQ ID NO:2. Within one 

d) amino acid residues 1-20 of SEQ ID NO:2; 60 embodiment any difference between the amino acid 

e) amino acid residues 21-133 of SEQ ID NO:2; sequence of the isolated polypeptide and the corresponding 
- . , , __ _ . „ ammo acid sequence of SEQ ID NO:2 is due to a co riser- 

f) ammo acid residues 112-142 of SEQ ID NO:2; vative amin0 add substimtion . wimiD embodimerj t 

g) ammo acid residues 1-111 of SEQ ID NO:2; and ^ c armno ac j d percent identity is determined using a FASTA 

h) amino acid residues 1-133 of SEQ ID NO:2. 65 program with ktup-1, gap opening penalty- 10, gap exten- 
Within another aspect the invention provides a fusion sion penalty-1, and substitution matrix-blosum62, with 

protein consisting of a first portion and a second portion other parameters set as default. 



US 6,498,235 B2 

7 8 

The invention also provides an isolated polynucleotide message. A minimal nucleotide sequence having all poten- 

selected from the group consisting of: tial introns spliced out was generated. The full length cDNA 

a) a sequence of nucleotides from nucleotide 139 to sequence was identified from a testis library and is disclosed 
nucleotide 411 of SEQ ID NO:l; * SE Q 10 N0:1 - ^ deduced amino acid sequence of this 

b) a sequence of nucleotide 139 to > 

nucleoude 477 of SEQ ID NO:l; , D ^ revealed m Qpen ? e admg frame encoding 142 

c) a sequence of nucleotides from nucleotide 139 to amino acids ( S EQ j D N0: 2) comprising a putative signal 
nucleotide 504 of SEQ ID NO:l; sequence (residues 1 to 20 of SEQ ID NO:2, nucleotides 79 

d) a sequence of nucleotides from nucleotide 79 to Q to 138 of SEQ ID NO: 1) and 122 amino acids of predicted 
nucleotide 504 of SEQ ID NO:l; mature sequence (residues 21 to 142 of SEQ ID NO:2, 

e) a sequence of nucleotides from nucleotide 1 to nucle- nucleotides 139 to 504 of SEQ ID NO:l) containing an 
otide 1094 of SEQ ID NO 1- extracellular domain (residues 21 to 111 of SEQ ID NO:2, 

A , , ... t ' \ . . ... . r „ nucleotides 139 to 411 of SEQ ID NO:l) containing six 

f) a polynucleotide that remains hybridized following ^ ^ ^ add residues 35 £ g4 g? £^ 

stringent wash conditions to a polynucleotide consist- 15 100 of SEQ 1D N0:2> , ^basic amino acid cleavage site, 

mg of the nucleotide sequence of SEQ ID NO:l, or the amino add residues 97 _ 99 of SEQ ID NQ:2; potemial 

complement of SEQ ID NO:l; and N-linked glycosylation sites at amino acid residues 83 and 

g) nucleotide sequences complementary to a), b), c), d), 86 of SEQ ID NO:2; and potential O-glycosylation sites at 
e), or f . amino acid residues 28, 36, 48, 52, 60, 65, 68, 78, 79, 80, 85, 

Further provided is an isolated polynucleotide encoding a 20 86, 90, 93 and 104 of SEQ ID NO: 2; a putative transmem- 

fusion protein consisting of a first portion and a second brane domain (residues 112 to 133 of SEQ ID NO:2, 

portion joined by a peptide bond, the first portion comprises nucleotides 412 to 477 of SEQ ID NO:l) and a cytoplasmic 

a polypeptide as described herein; and the second portion domain (residues 134 to 142 of SEQ ID NO:2, nucleotides 

comprising another polypeptide. 478 to 504 of SEQ ID NO:l). The overall structure of 

Also provided is an isolated polynucleotide encoding a 25 zpeplO is helical. Those skilled in the art will recognize that 

fusion protein comprising a secretory signal sequence hav- these domain boundaries are approximate, and are based on 

ing the amino acid sequence of amino acid residues 1-20 of alignments with known proteins and predictions of protein 

SEQ ID NO:2, wherein the secretory signal sequence is folding. ZpeplO does not share significant homology with 

operably linked to an additional polypeptide. any known protein. 

The invention also provides an isolated polynucleotide 30 Many proteins and hormones are processed into their 

comprising the sequence of nucleotide 1 to nucleotide 426 of mature forms by highly-specific proteolytic enzymes, pro- 

SEQ ID NO:3. hormone convertases, which carry out intracellular cleavage 

Within another aspect is provided an expression vector at the COOH-terminal side of dibasic sites within their 

comprising the following operably linked elements: substrate polypeptides. There are only a few dibasic amino 

a transcription promoter; a DNA segment encoding a 35 acid combinations, including lys-lys, arg-arg, arg-lys and 

polypeptide as described herein; and a transcription termi- lys-arg. ZpeplO polypeptides may be processed into an 

nator. active form through cleavage after lys (amino acid residue 

Within one embodiment the DNA segment encodes a 98 of SEQ ID NO:2) or arg (amino acid residue 99 of SEQ 

polypeptide covalently linked amino terminally or carboxy ID NO:2) of the tribasic site arg-lys-arg (amino acid residues 

terminally to an affinity tag. Within another embodiment the 40 97-99 of SEQ ID NO:2). Prohormone convertase PC4 

DNA segment further encodes a secretory signal sequence exhibits highly specific testis expression (WIPO publication, 

operably linked to the polypeptide. Within yet another WO98/50560) and may serve to cleave the zpeplO polypep- 

embodiment the secretory signal sequence comprises resi- tide. 

dues 1 to 20 of SEQ ID NO: 2. The present invention therefore provides post- 
Trie invention also provides a cultured cell into which has 45 translationally modified polypeptides or polypeptide frag- 
been introduced an expression vector as described herein; ments having the amino acid sequence from amino acid 
wherein the cell expresses the polypeptide encoded by the residue 21 to amino acid residue 98 of SEQ ID NO:2 and the 
DNA segment. amino acid sequence from amino acid residue 21 to amino 
The invention also provides a method of producing a acid residue 99 of SEQ ID NO:2. Examples of post trans- 
polypeptide comprising: culturing a cell into which has been so lational modifications include proteolytic cleavage, glyco- 
introduced an expression vector as described herein; sylation and disulfide bonding. 

whereby the cell expresses the polypeptide encoded by the Analysis of the tissue distribution of the mRNA corre- 
DNA segment; and recovering the expressed polypeptide. sponding to this novel DNA by Northern blot and Dot blot 
The present invention is based in part upon the discovery analysis suggest that zpeplO is a testis-specific protein 
of a novel DNA sequence (SEQ ID NO:l) and the corre- 55 having a transcript of about 1.5 kb. 
sponding deduced polypeptide sequence (SEQ ID NO:2) The present invention further provides polynucleotide 
which encode a testis-specific polypeptide designated molecules, including DNA and RNA molecules, encoding 
zpeplO. The novel zpeplO polypeptide-encoding polynucle- zpeplO proteins. The polynucleotides of the present inven- 
otides of the present invention were initially identified by tion include the sense strand; the anti-sense strand; and the 
querying an EST database for polypeptides containing 60 DNA as double-stranded, having both the sense and anti- 
repetitive patterns and post-translational processing sites sense strand annealed together by their respective hydrogen 
yielding potentially active peptides. The polypeptide corre- bonds. Representative DNA sequences encoding zpeplO 
sponding to an EST meeting those search criteria was further proteins are set forth in SEQ ID NO:l. DNA sequences 
analyzed and found to be a membrane glycoprotein. The encoding other zpeplO proteins can be readily generated by 
EST sequence was from a testis cell library. Several clones 65 those of ordinary skill in the art based on the genetic code, 
considered likely to contain the entire coding region were Counterpart RNA sequences can be generated by substitu- 
used for sequencing and resulted in an incompletely spliced tion of U for T. 
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Those skilled in the art will readily recognize that, in view 
of the degeneracy of the genetic code, considerable sequence 
variation is possible among these polynucleotide molecules. 
SEQ ID NO:3 is a degenerate DNA sequence that encom- 
passes all DNAs that encode the zpeplOpolypeptide of SEQ 
ID NO: 2. Those skilled in the art will recognize that the 
degenerate sequence of SEQ ID NO:3 also provides all RNA 
sequences encoding SEQ ID NO:2 by substituting U for T. 
Thus, zpeplO polypeptide-encoding polynucleotides com- 
prising nucleotide 1 to nucleotide 426 of SEQ ID NO:3 and 
their RNA equivalents are contemplated by the present 
invention. Table 1 sets forth the one-letter codes used within 
SEQ ID NO:3 to denote degenerate nucleotide positions. 
"Resolutions" are the nucleotides denoted by a code letter. 
"Complement" indicates the code for the complementary 
nucleotide(s). For example, the code Y denotes either C or 
T, and its complement R denotes A or G, A being comple- 
mentary to T, and G being complementary to C. 

TABLE 1 



Nucleotide 


Resolution 


Complement 


Nucleotide 


A 


A 


T 


T 


C 


C 


G 


G 


G 


G 


c 


C 


T 


T 


A 


A 


H 


A|G 


Y 


C|T 


Y 


c|t 


R 


A|G 


M 


A|C 


K 


G|T 


X 


G | T 


M 


A|C 


S 


C G 


S 


C|G 


w 


a|t 


W 


a|t 


H 


a|c|t 


D 


A|G|T 


B 


C|GjT 


V 


A|C|G 


V 


A|C|G 


B 


C|G|T 


D 


a|g|t 


H 


a|c|t 


N 


A |C |G| T 


N 


A|C|G|T 



The degenerate codons used in SEQ ID NO: 3, encom- 
passing all possible codons for a given amino acid, are set 
forth in Table 2. 



TABLE 2 



One 
Letter 
Code Codons 



Degenerate 
Codon 



Cys 


c 


TGC TGT 






TGY 


Ser 


s 


AGC AGT 


TCA 


TCC TCG TCT 


WSN 


Thr 


T 


ACA ACC 


ACG 


ACT 


ACN 


Pro 


P 


CCA CCC 


CCG 


CCT 


CCN 


Ala 


A 


GCA GCC 


GCG 


GCT 


GCN 


Gly 


G 


GGA GGC 


GGG 


GGT 


GGN 


Asn 


N 


AAC AAT 






AAY 


Asp 


D 


GAC GAT 






GAY 


Glu 


E 


GAA GAG 






GAR 


Gin 


Q 


CAA CAG 






CAR 


Hie 


H 


CAC CAT 






CAY 


Arg 


R 


AGA AGG 


CGA 


CGC CGG CGT 


MGN 


Lyo 


K 


AAA AAG 






AAR 


Met 


M 


ATG 






ATG 


He 


I 


ATA ATC 


ATT 




ATH 


Leu 


L 


CTA CTC 


CTG 


CTT TTA TTG 


YTN 


Val 


V 


GTA GTC 


GTG 


GTT 


GTN 


Phe 


F 


TTC TTT 






TTY 


Tyr 


Y 


TAC TAT 






TAY 


Trp 


W 


TGG 






TGG 


Ter 




TAA TAG 


TGA 




TRR 


Asn| Asp 


B 








RAY 


Glu j Gin 


Z 








SAR 


Any 


X 








NNN 



One of ordinary skill in the art will appreciate that some 
ambiguity is introduced in determining a degenerate codon, 



representative of all possible codons encoding each amino 
acid. For example, the degenerate codon for serine (WSN) 
can, in some circumstances, encode arginine (AGR), and the 
degenerate codon for arginine (MGN) can, in some 

5 circumstances, encode serine (AG Y). A similar relationship 
exists between codons encoding phenylalanine and leucine. 
Thus, some polynucleotides encompassed by the degenerate 
sequence may encode variant amino acid sequences, but one 
of ordinary skill in the art can easily identify such variant 

io sequences by reference to the amino acid sequence of SEQ 
ID NO:2. Variant sequences can be readily tested for func- 
tionality as described herein. 

One of ordinary skill in the art will also appreciate that 
different species can exhibit "preferential codon usage." In 

15 general, see, Grantham, et al., Nuc. Acids Res. 8:1893-912, 
1980; Haas,et al. Curr. Biol 6:315-24, 1996; Wain-Hobson, 
et al., Gene 13:355-64, 1981; Grosjean and Fiers, Gene 
18:199-209, 1982; Holm, Nuc. Acids Res. 14:3075-87, 
1986; Ikemura, J. Mol Biol. 158:573-97, 1982. As used 

20 herein, the term "preferential codon usage" or "preferential 
codons" is a term of art referring to protein translation 
codons that are most frequently used in cells of a certain 
species, thus favoring one or a few representatives of the 
possible codons encoding each amino acid (See Table 2). 

25 For example, the amino acid threonine (Thr) may be 
encoded by ACA, ACC, ACG, or ACT, but in mammalian 
cells ACC is the most commonly used codon; in other 
species, for example, insect cells, yeast, viruses or bacteria, 
different Thr codons may be preferential. Preferential 

30 codons for a particular species can be introduced into the 
polynucleotides of the present invention by a variety of 
methods known in the art. Introduction of preferential codon 
sequences into recombinant DNA can, for example, enhance 
production of the protein by making protein translation more 

35 efficient within a particular cell type or species. Therefore, 
the degenerate codon sequence disclosed in SEQ ID NO:3 
serves as a template for optimizing expression of polynucle- 
otides in various cell types and species commonly used in 
the art and disclosed herein. Sequences containing prefer- 

40 ential codons can be tested and optimized for expression in 
various species, and tested for functionality as disclosed 
herein. 

Within preferred embodiments of the invention the iso- 
lated polynucleotides will hybridize to similar sized regions 

45 of SEQ ID NO:l, other polynucleotide probes, primers, 
fragments and sequences recited herein or sequences 
complementary thereto. Polynucleotide hybridization is well 
known in the art and widely used for many applications, see 
for example, Sambrook et al., Molecular Cloning: A Labo- 

50 ratory Manual, Second Edition, Cold Spring Harbor, N.Y, 
1989; Ausubel et al., eds., Current Protocols in Molecular 
Biology, John Wiley and Sons, Inc., NY, 1987; Berger and 
Kimmel, eds., Guide to Molecular Cloning Techniques, 
Methods in Enzymology, volume 152, 1987 and Wetmur, 

55 Crit. Rev. Biochem. Mol Biol 26:227-59, 1990. Polynucle- 
otide hybridization exploits the ability of single stranded 
complementary sequences to form a double helix hybrid. 
Such hybrids include DNA-DNA, RNA-RNA and DNA- 
RNA 

60 Hybridization will occur between sequences which con- 
tain some degree of complementarity. Hybrids can tolerate 
mismatched base pairs in the double helix, but the stability 
of the hybrid is influenced by the degree of mismatch. The 
T m of the mismatched hybrid decreases by 1° C. for every 

65 1-1.5% base pair mismatch. Varying the stringency of the 
hybridization conditions allows control over the degree of 
mismatch that will be present in the hybrid. The degree of 
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stringency increases as the hybridization temperature 
increases and the ionic strength of the hybridization buffer 
decreases. Stringent hybridization conditions encompass 
temperatures of about 5-25° C. below the thermal melting 
point (T m ) of the hybrid and a hybridization buffer having up 
to 1 M Na*. Higher degrees of stringency at lower tempera- 
tures can be achieved with the addition of formamide which 
reduces the T m of the hybrid about 1° C. for each 1% 
formamide in the buffer solution. Generally, such stringent 
conditions include temperatures of 20-70° C. and a hybrid- 
ization buffer containing up to 6x SSC and 0-50% forma- 
mide. A higher degree of stringency can be achieved at 
temperatures of from 40-70° C. with a hybridization buffer 
having up to 4x SSC and from 0-50% formamide. Highly 
stringent conditions typically encompass temperatures of 
42-70° C. with a hybridization buffer having up to lx SSC 
and 0-50% formamide. Different degrees of stringency can 
be used during hybridization and washing to achieve maxi- 
mum specific binding to the target sequence. Typically, the 
washes following hybridization are performed at increasing 
degrees of stringency to remove non-hybridized polynucle- 
otide probes from hybridized complexes. 

The above conditions are meant to serve as a guide and it 
is well within the abilities of one skilled in the art to adapt 
these conditions for use with a particular polypeptide hybrid. 
The T m for a specific target sequence is the temperature 
(under defined conditions) at which 50% of the target 
sequence will hybridize to a perfectly matched probe 
sequence. Those conditions which influence the T m include, 
the size and base pair content of the polynucleotide probe, 
the ionic strength of the hybridization solution, and the 
presence of destabilizing agents in the hybridization solu- 
tion. Numerous equations for calculating T m are known in 
the art, see for example (Sambrook et al., ibid.; Ausubel et 
al., ibid.; Berger and Kimmel, ibid, and Wetmur, ibid.) and 
are specific for DNA, RNA and DNA-RNA hybrids and 
polynucleotide probe sequences of varying length. Sequence 
analysis software such as Oligo 4.0 (publicly available 
shareware) and Primer Premier (PREMIER Biosoft 
International, Palo Alto, Calif.) as well as sites on the 
Internet, are available tools for analyzing a given sequence 
and calculating T m based on user defined criteria. Such 
programs can also analyze a given sequence under defined 
conditions and suggest suitable probe sequences. Typically, 
hybridization of longer polynucleotide sequences, >50 bp, is 
done at temperatures of about 20-25° C. below the calcu- 
lated T m . For smaller probes, <50 bp, hybridization is 
typically carried out at the T m or 5-10° C. below. This allows 
for the maximum rate of hybridization for DNA-DNA and 
DNA-RNA hybrids. 

The length of the polynucleotide sequence influences the 
rate and stability of hybrid formation. Smaller probe 
sequences, <50 bp, come to equilibrium with complemen- 
tary sequences rapidly, but may form less stable hybrids. 
Incubation times of anywhere from minutes to hours can be 
used to achieve hybrid formation. Longer probe sequences 
come to equilibrium more slowly, but form more stable 
complexes even at lower temperatures. Incubations are 
allowed to proceed overnight or longer. Generally, incuba- 
tions arc carried out for a period equal to three times the 
calculated Cot time. Cot time, the time it takes for the 
polynucleotide sequences to reassociate, can be calculated 
for a particular sequence by methods known in the art. 

The base pair composition of polynucleotide sequence 
will effect the thermal stability of the hybrid complex, 
thereby influencing the choice of hybridization temperature 
and the ionic strength of the hybridization buffer. A-T pairs 
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are less stable than G-C pairs in aqueous solutions contain- 
ing NaCl. Therefor, the higher the G-C content, the more 
stable the hybrid. Even distribution of G and C residues 
within the sequence also contribute positively to hybrid 

5 stability. Base pair composition can be manipulated to alter 
the T m of a given sequence, for example, 
5-methyldeoxycytidine can be substituted for deoxycytmi- 
dine and 5-bromodeoxuride can be substituted for thymidine 
to increase the T m . 7-deazz-2'-deoxyguanosine can be sub- 

JQ stituted for guanosine to reduce dependence on T m . 

Ionic concentration of the hybridization buffer also effects 
the stability of the hybrid. Hybridization buffers generally 
contain blocking agents such as Denhardt's solution (Sigma 
Chemical Co., St. Louis, Mo.), denatured salmon sperm 
DNA, tRNA, milk powders (BLOTTO), heparin or SDS, 

15 and a Na + source, such as SSC (lx SSC: 0.15 M NaCl, 15 
mM sodium citrate) or SSPE (lx SSPE: 1.8 M NaCi, 10 mM 
NaH 2 P0 4 , 1 mM EDTA, pH 7.7). By decreasing the ionic 
concentration of the buffer, the stability of the hybrid is 
increased. Typically, hybridization buffers contain from 

20 between 10 mM-1 M Na + . Premixed hybridization solutions 
are also available from commercial sources such as Clontech 
Laboratories (Palo Alto, Calif.) and Promega Corporation 
(Madison, Wis.) for use according to manufacturer's instruc- 
tion. Addition of destabilizing or denaturing agents such as 

25 formamide, tetralkylammonium salts, guanidinium cations 
or thiocyanate canons to the hybridization solution will alter 
the T m of a hybrid. Typically, formamide is used at a 
concentration of up to 50% to allow incubations to be carried 
out at more convenient and lower temperatures. Formamide 

30 also acts to reduce non-specific background when using 
RNA probes. 

As previously noted, the isolated zpeplOpoly nucleotides 
of the present invention include DNA and RNA. Methods 
for isolating DNA and RNA are well known in the art. It is 

35 generally preferred to isolate RNA from lymph node, 
although DNA can also be prepared using RNA from other 
tissues or isolated as genomic DNA. Total RNA can be 
prepared using guanidine HC1 extraction followed by iso- 
lation by centrifugation in a CsCl gradient (Chirgwin et al., 

40 Biochemistry 18:52-94, 1979). Poly (A) + RNA is prepared 
from total RNA using the method of Aviv and Leder (Proc. 
Natl Acad. Set. USA 69:1408-12, 1972). Complementary 
DNA (cDNA) is prepared from poly(A)* RNA using known 
methods. Polynucleotides encoding zpeplOpolypeptides are 

45 then identified and isolated by, for example, hybridization or 
PCR. 

The polynucleotides of the present invention can also be 
synthesized using automated equipment. The current 
method of choice is the phosphoramidite method. If chemi- 

50 cally synthesized double stranded DNA is required for an 
application such as the synthesis of a gene or a gene 
fragment, then each complementary strand is made sepa- 
rately. The production of short genes (60 to 80 bp) is 
technically straightforward and can be accomplished by 

55 synthesizing the complementary strands and then annealing 
them. For the production of longer genes (>300 bp), 
however, special strategies must be invoked, because the 
coupling efficiency of each cycle during chemical DNA 
synthesis is seldom 100%. To overcome this problem, syn- 

60 thetic genes (double -stranded) are assembled in modular 
form from single-stranded fragments that are from 20 to 100 
nucleotides in length. Gene synthesis methods are well 
known in the art. See, for example, Glide and Pasternak, 
Molecular Biotechnology, Principles & Applications of 

65 Recombinant DNA, ASM Press, Washington, D.C., 1994; 
Itakura et iLAnnu. Rev. Biochem. 53: 323-356, 1984; and 
Climie et al., Proc. Natl Acad. Sci. USA 87:633-637, 1990. 
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The zpeplO polynucleotide sequences disclosed herein 
can be used to isolate polynucleotides encoding other 
zpeplO proteins. Such other proteins include alternatively 
spliced cDNAs (including cDNAs encoding secreted zpeplO 
proteins) and counterpart polynucleotides from other species 
(orthologs). These orthologous polynucleotides can be used, 
inter alia, to prepare the respective orthologous proteins. 
Other species of interest include, but are not limited to, 
mammalian, avian, amphibian, reptile, fish, insect and other 
vertebrate and invertebrate species. Of particular interest are 
zpeplO polynucleotides and proteins from other mammalian 
species, including human and other primates, porcine, ovine, 
bovine, canine, feline, and equine polynucleotides and pro- 
teins. Orthologs of mouse zpeplO, for example, can be 
cloned using information and compositions provided by the 
present invention in combination with conventional cloning 
techniques. For example, a cDNA can be cloned using 
mRNA obtained from a tissue or cell type that expresses 
zpeplO as disclosed herein. Suitable sources of mRNA can 
be identified by probing Northern blots with probes designed 
from the sequences disclosed herein. A library is then 
prepared from mRNA of a positive tissue or cell line. A 
zpeplO-encoding cDNA can then be Isolated by a variety of 
methods, such as by probing with a complete or partial 
human cDNAor with one or more sets of degenerate probes 
based on the disclosed sequences. A cDNA can also be 
cloned using the polymerase chain reaction, or PCR (Mullis, 
U.S. Pat. No. 4,683,202), using primers designed from the 
representative human zpeplO sequence disclosed herein. 
Within an additional method, the cDNA library can be used 
to transform or transfect host cells, and expression of the 
cDNA of interest can be detected with an antibody to zpeplO 
polypeptide. Similar techniques can also be applied to the 
isolation of genomic clones. Electronic databases can also 
be screened for EST sequences of zpeplO orthologs. Degen- 
erate polynucleotide primer sequences useful for identifying 
zpeplO orthologs would include: 



zpeplO residues 15-20 of SEQ ID NO: 2 

CARGCNTGYGTNTTYTG (SEQ ID N0:4) 
zpeplO residues 42-47 of SEQ ID MO: 2 

CARAARGARTGYGGNGC (SEQ ID NO:5) 
zpeplO residues 61-6 6 of SEQ ID NO: 2 

ATGAAYAARGRNACNGA (SEQ ID NO:6) 
zpeplO residues 64-69 of SEQ ID NO:2 

GRNACNGARAARA.CN CA (SEQ ID NO:7) 
zpeplO residues 8 6-91 of SEQ ID NO: 2 

ACNTGY AARGGN ACN GA (SEQ ID NO : 8 ) . 
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Those skilled in the art will recognize that the sequences 
disclosed in SEQ ID NO:l and SEQ ID NO:2 represent a 
single allele of the human zpeplO gene and polypeptide, and 
that allelic variation and alternative splicing are expected to 

5 occur. In addition, allelic variants can be cloned by probing 
cDNA or genomic libraries from different individuals 
according to standard procedures. Allelic variants of the 
DNA sequence shown in SEQ ID NO:l, including those 

jo containing silent mutations and those in which mutations 
result in amino acid sequence changes, are within the scope 
of the present invention, as are proteins which are allelic 
variants of SEQ ID NO:2. cDNAs generated from alterna- 
tively spliced mRNAs, which retain the properties of the 

15 zpeplO polypeptide are included within the scope of the 
present invention, as are polypeptides encoded by such 
cDNAs and mRNAs. Allelic variants and splice variants of 
these sequences can be cloned by probing cDNA or. 

2Q genomic libraries from different individuals or tissues 
according to standard procedures known in the art. 

The present invention also provides isolated zpeplO 
polypeptides that are substantially homologous to the 
polypeptide of SEQ ID N 0:2 and its species orthologs. The 

25 term "substantially homologous" is used herein to denote 
polypeptides having 60%, preferably at least 80%, sequence 
identity to the sequences shown in SEQ ID NO:2 or their 
orthologs. Such polypeptides will more preferably be at least 
90% identical, and most preferably 95% or more identical to 

30 SEQ ID NO: 2 or its orthologs. Percent sequence identity is 
determined by conventional methods. See, for example, 
Altschul et al., Dull. Math. Bio. 48: 603-16, 1986 and 
Henikoff and Henikoff, Proc. Natl. Acad. ScL USA 
89:10915-9, 1992. Briefly, two amino acid sequences are 
aligned to optimize the alignment scores using a gap open- 
ing penalty of 10, a gap extension penalty of 1, and the 
"blosum 62" scoring matrix of Henikoff and Henikoff (ibid.) 
as shown in Table 3 (amino acids are indicated by the 

40 standard one-letter codes). The percent identity is then 
calculated as: 

Total number of identical matches ^ ^ 
[length of the longer sequence plus the 
45 number of gaps introduced into the longer 

sequence in order to align the two sequences] 



TABLE 3 





A 


R 


N 


D 


c 


Q 


E 


G 


H 


I 


L 


K 


M 


F 


P S T W Y V 


A 


4 






























R 


-1 


5 




























N 


-2 


0 


6 


























D 


-2 


-2 


1 


6 
























C 


0 


-3 


-3 


-3 


9 






















Q 


-1 




0 


0 


-3 


5 




















E 


-1 


0 


0 


2 


-4 


2 


5 


















G 


0 


-2 


0 


-1 


-3 


-2 


-2 


6 
















H 
I 


-2 
-1 


0 
-3 


-3 


-1 

-3 


-3 
-1 


0 
-3 


0 

-3 


-2 
-4 


8 
-3 


4 












L 


-3 


-2 


-3 


-4 


-1 


-2 


-3 


-4 


-3 


2 


4 










K 


-1 


2 


0 


-1 


-3 


1 


1 


-2 




-3 


-2 


5 








M 


-1 


-1 


-2 


-3 


-1 


0 


-2 


-3 


-2 


1 


2 




5 






F 


-2 


-3 


-3 


-3 


-2 


-3 


-3 


-3 


-1 


0 


0 


-3 


0 


6 




P 


-1 


-2 


-2 


-1 


-3 


-1 


-1 


-2 


-2 


-3 


-3 




-2 


-4 


7 


S 


3 


-1 


1 


0 


-1 


0 


0 


0 


-1 


-2 


-2 


0 


-1 


-2 


-1 4 
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R 


N 


D 


C 


Q 


E 


G 


H 


I 


L 


K M 


F 


P 


S 


T 


W 


Y 


V 


T 


0 


-1 


0 


-1 


-1 


-1 


-1 


-2 


-2 


-1 


-1 


-1 -1 


-2 


-1 


1 


5 








W 


-3 


-3 


-4 


-4 


-2 


-2 


-3 


-2 


-2 


-3 


-2 


-3 -1 


1 


-4 


-3 


-2 


11 
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Sequence identity of polynucleotide molecules is deter- 
mined by similar methods using a ratio as disclosed above. 

Those skilled in the art appreciate that there are many 
established algorithms available to align two amino acid 
sequences. The "FASTA" similarity search algorithm of 
Pearson and Lipman is a suitable protein alignment method 
for examining the level of identity shared by an amino acid 
sequence disclosed herein and the amino acid sequence of a 
putative variant zpeplO. The FASTA algorithm is described 
by Pearson and Lipman, Proc. Nat. Acad, Sci. USA 85:2444, 
1988, and by Pearson, Meth. EnzymoL 183:63, 1990. 

Briefly, FASTA first characterizes sequence similarity by 
identifying regions shared by the query sequence (e.g., SEQ 
ID NO: 2) and a test sequence that have either the highest 
density of identities (if the ktup variable is 1) or pairs of 
identities (if ktup-2), without considering conservative 
amino acid substitutions, insertions, or deletions. The ten 
regions with the highest density of identities are then 
re-scored by comparing the similarity of all paired amino 
acids using an amino acid substitution matrix, and the ends 
of the regions are "trimmed" to include only those residues 
that contribute to the highest score. If there are several 
regions with scores greater than the "cutoff' value 
(calculated by a predetermined formula based upon the 
length of the sequence and the ktup value), then the trimmed 
initial regions are examined to determine whether the 
regions can be joined to form an approximate alignment 
with gaps. Finally, the highest scoring regions of the two 
amino acid sequences are aligned using a modification of the 
Needleman-Wunsch-Sellers algorithm (Needleman and 
Wunsch.J. Mol Biol. 48:444, 1970; Sellers, SIAMJ. Appl. 
Math. 26:787, 1974), which allows for amino acid insertions 
and deletions. Preferred parameters for FASTA analysis are: 
ktup=l, gap opening penalty=10, gap extension penalty=l, 
and substitution matrix-BLOSUM62. These parameters can 
be introduced into a FASTA program by modifying the 
scoring matrix file ("SMATRIX"), as explained in Appendix 
2 of Pearson, Meth. Enzymol. 183:63, 1990. 

FASTA can also be used to determine the sequence 
identity of nucleic acid molecules using a ratio as disclosed 
above. For nucleotide sequence comparisons, the ktup value 
can range between one to six, preferably from three to six, 
most preferably three, with other parameters set as default. 

The BLOSUM62 table is an amino acid substitution 
matrix derived from about 2,000 local multiple alignments 
of protein sequence segments, representing highly con- 
served regions of more than 500 groups of related proteins 
(Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 
89:10915, 1992). Accordingly, the BLOSUM62 substitution 
frequencies can be used to define conservative amino acid 
substitutions that may be introduced into the amino acid 
sequences of the present invention. Although it is possible to 
design amino acid substitutions based solely upon chemical 
properties (as discussed above), the language "conservative 
amino acid substitution" preferably refers to a substitution 
represented by a BLOSUM62 value of greater than -1. For 
example, an amino acid substitution is conservative if the 
substitution is characterized by a BLOSUM62 value of 0, 1, 



2, or 3. According to this system, preferred conservative 
amino acid substitutions are characterized by a BLOSUM62 
value of at least 1 (e.g., 1, 2 or 3), while more preferred 
conservative amino acid substitutions are characterized by a 

15 BLOSUM62 value of at least 2 (e.g., 2 or 3). 

Substantially homologous proteins and polypeptides are 
characterized as having one or more amino acid 
substitutions, deletions or additions. These changes are 
preferably of a minor nature, that is conservative amino acid 

20 substitutions (see Table 4) and other substitutions that do not 
significantly affect the folding or activity of the protein or 
polypeptide; small deletions, typically of one to about 30 
amino acids; and small amino- or carboxyl-terminal 
extensions, such as an amino -terminal methionine residue, a 

25 small linker peptide of up to about 20-25 residues or an 
affinity tag. Polypeptides comprising affinity tags can further 
comprise a proteolytic cleavage site between the zpeplO 
polypeptide and the affinity tag. Preferred such sites include 
thrombin cleavage sites and factor Xa cleavage sites. 

30 

TABLE 4 

Conservative amino acid substitutions 

Basic: arginine 

35 1 Y sine 

histidine 

Acidic: 
Polar: 



Hydrophobic: 



aspartic acid 



leucine 

isoleucine 

valine 

phenylalanine 

tryptophan 

tyrosine 

glycine 

alanine 

threonine 
methionine 



50 The proteins of the present invention can also comprise 
non-naturally occurring amino acid residues. Non-naturally 
occurring amino acids include, without limitation, trans-3- 
methylproline, 2,4-methanoproline, cis-4-hydroxyproline, 
trans-4-hydroxyproline, N-methyl-glycine, allo-threonine, 

55 methylthreonine, hydroxyethy 1-cysteine, 
hydroxyethylhomocysteine, nitroglutamine, homo- 
glutamine, pipecolic acid, thiazolidine carboxylic acid, 
dehydroproline, 3- and 4-methylproline, 3,3-dimethyl- 
proline, tert-leucine, norvaline, 2-azaphenyl-alanine, 

60 3-azapbenylalanine, 4-azaphenylalanine, and 
4-fiuoropbenyl-alanine. Several methods are known in the 
art for incorporating non-naturally occurring amino acid 
residues into proteins. For example, an in vitro system can 
be employed wherein nonsense mutations are suppressed 

65 using chemically aminoacylated suppressor tRNAs. Meth- 
ods for synthesizing amino acids and aminoacylating tRNA 
are known in the art. Transcription and translation of plas- 
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mids containing nonsense mutations is carried out in a 241:53-57, 1988) or Bowie and Sauer (Proc. Natl Acad. 

cell-free system comprising an E. coli S30 extract and Sci. USA 86:2152-2156, 1989). Briefly, these authors dis- 

commercially available enzymes and other reagents. Pro- close methods for simultaneously randomizing two or more 

teins are purified by chromatography. See, for example, positions in a polypeptide, selecting for functional 

Robertson et al., J. Am. Chem. Soc. 113:2722, 1991; Ellman 5 polypeptide, and then sequencing the mutagenized polypep- 

et al., Methods EnzymoL 202:301, 1991; Chung et al., tides to determine the spectrum of allowable substitutions at 

Science 259:806-9, 1993; and Chung et al., Proc. Natl. each position. Other methods that can be used include phage 

Acad. Sci. USA 90:10145-9, 1993). In a second method, display (e.g., Lowman et al., Biochem. 30:10832-10837, 

translation is carried out in Xenopus oocytes by microin- 1991; Ladner et al., U.S. Pat. No. 5,223,409; Huse, WIPO 

jection of mutated mRNA and chemically aminoacylated 10 Publication WO 92/06204) and region-directed mutagenesis 

suppressor tRNAs (Turcatti et al., J. Biol. Chem. (Derbyshire et al., Gene 46:145, 1986; Ner et al., DNA 

271:19991-8, 1996). Within a third method, E. coli cells are 7:127, 1988). 

cultured in the absence of a natural amino acid that is to be Variants of the disclosed zpeplO DNA and polypeptide 
replaced (e.g., phenylalanine) and in the presence of the sequences can be generated through DNA shuffling as 
desired non-naturally occurring amino acid(s) (e.g., 15 disclosed by Slemmer, Nature 370:389-91, 1994, Stemmer, 
2-azaphenylalanine, 3-azaphenylalanine, Proc. Natl. Acad. Sci. USA 91:10747-51, 1994 and WIPO 
4-azaphenylalanine, or 4-fluorophenylalanine). The non- Publication WO 97/20078. Briefly, variant DNAs are gen- 
naturally occurring amino acid is incorporated into the era ted by in vitro homologous recombination by random 
protein in place of its natural counterpart. See, Koide et al., fragmentation of a parent DNA followed by reassembly 
Biochem. 33:7470-6, 1994. Naturally occurring amino acid 20 using PCR, resulting in randomly introduced point muta- 
residues can be converted to non-naturally occurring species tions. This technique can be modified by using a family of 
by in vitro chemical modification. Chemical modification parent DNAs, such as allelic variants or DNAs from differ- 
can be combined with site-directed mutagenesis to further ent species, to introduce additional variability into the pro- 
expand the range of substitutions (Wynn and Richards, cess. Selection or screening for the desired activity, followed 
Protein Sci. 2:39-403, 1993). 25 by additional iterations of mutagenesis and assay provides 
A limited number of non-conservative amino acids, ammo for rapid "evolution" of sequences by selecting for desirable 
acids that are not encoded by the genetic code, non-naturally mutations while simultaneously selecting against detrimen- 
occurring amino acids, and unnatural amino acids may be tat changes. 

substituted for zpeplO amino acid residues. Mutagenesis methods as disclosed above can be com- 

Essential amino acids in the zpeplO polypeptides of the 30 bined with high-throughput, automated screening methods 

present invention can be identified according to procedures to detect activity of cloned, mutagenized polypeptides in 

known in the art, such as site-directed mutagenesis or host cells. Mutagenized DNA molecules that encode active 

alanine-scanning mutagenesis (Cunningham and Wells, Sci- polypeptides (e.g., ligand binding receptors) can be recov- 

ence 244: 1081-5, 1989). In the latter technique, single ered from the host cells and rapidly sequenced using modern 

alanine mutations are introduced at every residue in the 35 equipment. These methods allow the rapid determination of 

molecule, and the resultant mutant molecules are tested for the importance of individual amino acid residues in a 

biological activity (e.g., adhesion-modulation, polypeptide of interest, and can be applied to polypeptides 

differentiation-modulation or the like) to identify amino acid of unknown structure. 

residues that are critical to the activity of the molecule. See Using the methods discussed above, one of ordinary skill 
also, Hilton et al.,7. Biol. Chem. 271:4699-708, 1996. Sites 40 in the art can identify and/or prepare a variety of polypep- 
of ligand-receptor or other biological interaction can also be tides that are substantially homologous to, for example, 
determined by physical analysis of structure, as determined residues 21 to 111, 21 to 142 or 1 to 142 of SEQ ID NO:2 
by such techniques as nuclear magnetic resonance, or allelic variants thereof and retain the properties of wild- 
crystallography, electron diffraction or pholoaffinity type protein. Such polypeptides may include additional 
labeling, in conjunction with mutation of putative contact 45 amino acids, such as affinity tags and the like. Such polypep- 
site amino acids. See, for example, de Vos et al., Science tides may also include additional polypeptide segments as 
255:306-12, 1992; Smith et al., J. Mol Biol 224:899-904, generally disclosed herein. 

1992; Wlodaver et al., FEBS Lett. 309:59-64, 1992. The The invention also provides soluble polypeptides. It is 
identities of essential amino acids can also be inferred from preferred that these soluble polypeptides be extracellular 
analysis of homologies with related proteins. Amino acid 50 polypeptides and be in a form substantially free of trans- 
residues that might be considered essential in the zpeplO membrane and intracellular polypeptide segments. To direct 
polypeptide are cysteine residues at amino acid residues 17, the export of the soluble polypeptides from the host cell, the 
20, 35, 45, 84, 87, 94 and 100 of SEQ ID NO:2; the potential DNA encoding the soluble polypeptide is linked to a second 
arg-lys-arg tri-basic amino acid cleavage site at amino acid DNA segment encoding a secretory peptide, such as a t-PA 
residues 97-99 of SEQ ID NO:2; the potential N-linked 55 secretory peptide or the native zpeplO secretory signal 
glycosylate sites at amino acid residues 83 and 86 of SEQ sequence (amino acid residues 1-20 of SEQ ID NO:2). To 
ID NO: 2 and the potential O-glycosylation sites at amino facilitate purification of the secreted polypeptide, an N- or 
acid residues 28, 36, 48, 52, 60, 65, 68, 78, 79, 80, 85, 86, C-terminal extension, such as an affinity tag or another 
90, 93 and 104 of SEQ ID NO:2. A hydrophobicity profile polypeptide or protein for which an antibody or other 
of SEQ ID NO:2 is shown in the attached FIGURE. Those 60 specific binding agent is available, can be fused to the 
skilled in the art will recognize that this hydrophobicity will soluble polypeptide. 

be taken into account when designing alterations in the The present invention also provides zpeplOfusion pro- 

amino acid sequence of a zpeplO polypeptide, so as not to teins. For example, fusion proteins of the present invention 

disrupt the overall profile. encompass 

Multiple amino acid substitutions can be made and tested 65 (1) a polypeptide selected from the following: a) a 

using known methods of mutagenesis and screening, such as polypeptide comprising a sequence of amino acid resi- 

those disclosed by Reidhaar-Olson and Sauer {Science dues from amino acid residue 21 to amino acid residue 
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111 of SEQ ID NO:2; and b) a polypeptide comprising DNA may be provided by integration into the host cell 

a sequence of amino acid residues from amino acid genome. Selection of promoters, terminators, selectable 

residue 1 to amino acid residue 20 of SEQ ID NO:2; markers, vectors and other elements is a matter of routine 

and design within the level of ordinary skill in the art. Many such 

(2) another polypeptide. The other polypeptide may be a s elements are described in the literature and are available 

signal peptide to facilitate secretion of the fusion through commercial suppliers. 

protein, a transmembrane and/or cytoplasmic domain, To direct a zpeplO polypeptide into the secretory pathway 
or another soluble polypeptide or the like. For example, of a host cell, a secretory signal sequence (also known as a 
the extracellular portion of a zpeplO polypeptide can be signal sequence, leader sequence, prepro sequence or pre 
prepared as a fusion to a dimerizing protein as dis- 10 sequence) is provided in ihe expression vector. The secretory 
closed in U.S. Pat. Nos. 5,155,027 and 5,567,584. signal sequence may be that of the zpeplO polypeptide, or 
Preferred dimerizing proteins in this regard include may be derived from another secreted protein (e.g., t-PA) or 
immunoglobulin constant region domains. synthesized de novo. The secretory signal sequence is joined 
Immunoglobulin-zpeplO polypeptide fusions can be to the zpeplO DNA sequence in the correct reading frame 
expressed in genetically engineered cells to produce a 15 and positioned to direct newly synthesized polypeptide into 
variety of multimeric zpeplO analogs. Auxiliary secretory pathways to host cell. Secretory signal sequences 
domains can be fused to zpeplO polypeptides to target are commonly positioned 5' to the DNA sequence encoding 
them to specific cells, tissues, or macromolecules. For the polypeptide of interest, although certain secretory signal 
example, a soluble zpeplO polypeptide or protein could sequences may be positioned elsewhere in the DNA 
be targeted to a predetermined cell type by fusing a 20 sequence of interest (see, e.g., Welch et al., U.S. Pat. No. 
zpeplO polypeptide to a ligand that specifically binds to 5,037,743; Holland et al., U.S. Pat. No. 5,143,830). 
a receptor on the surface of the target cell. In this way, Alternatively, the secretory signal sequence contained in 
polypeptides and proteins can be targeted for therapeu- the polypeptides of the present invention is used to direct 
tic or diagnostic purposes. A zpeplO polypeptide can be other polypeptides into the secretory pathway. The present 
fused to two or more moieties, such as an affinity tag for 25 invention provides for such fusion polypeptides. A signal 
purification and a targeting domain. Polypeptide fusion polypeptide can be made wherein a secretory signal 
fusions can also comprise one or more cleavage sites, sequence derived from amino acid residues 1-20 of SEQ ID 
particularly between domains. See, Tuan et al., Con- NO:2 is be operably linked to another polypeptide using 
nective Tissue Research 34:1-9, 1996. methods known in the art and disclosed herein. The secre- 
The soluble zpeplO polypeptide is useful in studying the 30 tory signal sequence contained in the fusion polypeptides of 
distribution of zpeplO receptors on tissues or specific cell the present invention is preferably fused amino -terminally to 
lineages, and to provide insight into receptor/ligand biology. an additional peptide to direct the additional peptide into the 
Using labeled soluble zpeplO, cells expressing the ligand are secretory pathway. Such constructs have numerous applica- 
identified by fluorescence immunocytometry or immunohis- tions known in the art. For example, these novel secretory 
tochemistry. The effects of zpeplO on steroidogenesis or 35 signal sequence fusion constructs can direct the secretion of 
Leydig or Sertoli cell expression can be examined by an active component of a normally non-secreted protein, 
probing tissue slices with soluble zpeplO fusions, see for such as a receptor. Such fusions may be used in vivo or in 
example, Daehlin et al., Scand. J. Urol. Nephrol. 19:7-12, vitro to direct peptides through the secretory pathway. 
1985; Gavino et al., Arch. Biochem. Biophys. 233:741-7, Cultured mammalian cells are suitable hosts within the 
1984 and von Schnakenburg et al., Acta Endocrinol 40 present invention. Methods for introducing exogenous DNA 
94:397-403, 1980). luteinizing hormone (LH) and follicle into mammalian host cells include calcium phosphate- 
stimulating hormone (FSH) responses could also be exam- mediated transfection (Wigler et al., Cell 14:725, 1978; 
ined in soluble zpeplO-treated tissue slices. Corsaro and Pearson, Somatic Cell Genetics 7:603, 1981: 
The polypeptides of the present invention, including Graham and Van der Eb, Virology 52:456, 1973), eleclropo- 
full-length proteins, fragments thereof and fusion proteins, 45 ration (Neumann et al., EMBO J. 1:841-845, 1982), DEAE- 
can be produced in genetically engineered host cells accord- dextran mediated transfection (Ausubel et al., eds., Current 
ing to conventional techniques. Suitable host cells are those Protocols in Molecular Biology, John Wiley and Sons, Inc., 
cell types that can be transformed or transfected with NY, 1987), liposome-mediated transfection (Hawley-Nelson 
exogenous DNA and grown in culture, and include bacteria, et al., Focus 15:73, 1993; Ciccaronc et al., Focus 15:80, 
fungal cells, and cultured higher eukaryotic cells. Eukaryotic 50 1993), and viral vectors (Miller and Rosman, BioTechniques 
cells, particularly cultured cells of multicellular organisms, 7:980-90, 1989; Wang and Finer, Nature Med. 2:714-16, 
are preferred. Techniques for manipulating cloned DNA 1996). The production of recombinant polypeptides in cul- 
molecules and introducing exogenous DNA in to a variety of tured mammalian cells is disclosed, for example, by 
host cells are disclosed by Sambrook et al., Molecular Levinson et al., U.S. Pat. No. 4,713339; Hagen et al., U.S. 
Cloning: A Laboratory Manual, 2nd ed., Cold Spring Har- 55 Pat. No. 4,784,950; Palmiter et al., U.S. Pat. No. 4,579,821; 
bor Laboratory Press, Cold Spring Harbor, N.Y., 1989, and and Ringold, U.S Pat. No. 4,656,134. Suitable cultured 
Ausubel et al. (eds.), Current Protocols in Molecular mammalian cells include the COS-1 (ATCC No. CRL 1650), 
Biology, John Wiley and Sons, Inc., NY, 1987. COS-7 (ATCC No. CRL 1651), BHK 570 (ATCC No. CRL 
In general, a DNA sequence encoding a zpeplO polypep- 10314), 293 (ATCC No. CRL 1573; Graham et al., J. Gen. 
tide of the present invention is operably linked to other 60 Virol. 36:59-72, 1977) and Chinese hamster ovary (e.g. 
genetic elements required for its expression, generally CHO-K1; ATCC No. CCL 61) cell lines. Additional suitable 
including a transcription promoter and terminator within an cell lines are known in the art and available from public 
expression vector. The vector will also commonly contain depositories such as the American Type Culture Collection, 
one or more selectable markers and one or more origins of Rockville, Md. In general, strong transcription promoters 
replication, although those skilled in the art will recognize 65 are preferred, such as promoters from SV-40 or cytomega- 
that within certain systems selectable markers may be pro- lovims. See, e.g., U.S. Pat. No. 4,956,288. Other suitable 
vided on separate vectors, and replication of the exogenous promoters include those from metallolhionein genes (U.S. 
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Pat. Nos. 4,579,821 and 4,601,978) and the adenovirus 
major late promoter. 

Drug selection is generally used to select for cultured 
mammalian cells into which foreign DNA has been inserted. 
Such cells are commonly referred to as "transfectants". Cells 
that have been cultured in the presence of the selective agent 
and are able to pass the gene of interest to their progeny are 
referred to as "stable transfectants." A preferred selectable 
marker is a gene encoding resistance to the antibiotic 
neomycin. Selection is carried out in the presence of a 
neomycin-type drug, such as G-418 or the like. Selection 
systems may also be used to increase the expression level of 
the gene of interest, a process referred to as "amplification." 
Amplification is carried out by culturing transfectants in the 
presence of a low level of the selective agent and then 
increasing the amount of selective agent to select for cells 
that produce high levels of the products of the introduced 
genes. A preferred amplifiable selectable marker is dihydro- 
folate reductase, which confers resistance to methotrexate. 
Other drug resistance genes (e.g., hygromycin resistance, 
multi-drug resistance, puromycin acetyltransferase) can also 
be used. Alternative markers that introduce an altered 
phenotype, such as green fluorescent protein, or cell surface 
proteins such as CD4, CD8, Class I MHC, placental alkaline 
phosphatase may be used to sort transfected cells from 
untransfcctcd cells by such means as FACS sorting or 
magnetic bead separation technology. 

Other higher eukaryotic cells can also be used as hosts, 
including plant cells, insect cells and avian cells. The use of 
Agrobacterium rhizogenes as a vector for expressing genes 
in plant cells has been reviewed by Sinkar et al., J. Biosci. 
(angalore) 11:47-58, 1987. Transformation of insect cells 
and production of foreign polypeptides therein is disclosed 
by Guarino et al., U.S. Pat. No. 5,162,222 and WIPO 
publication WO 94/06463. Insect cells can be infected with 
recombinant baculovirus; commonly derived from 
Autographa californica nuclear polyhedrosis virus 
(AcNPV). DNA encoding the zpeplO polypeptide is inserted 
into the baculoviral genome in place of the AcNPV poly- 
hedrin gene coding sequence by one of two methods. The 
first is the traditional method of homologous DNA recom- 
bination between wild- type AcNPV and a transfer vector 
containing the zpeplO flanked by AcNPV sequences. Suit- 
able insect cells, e.g. SF9 cells, are infected with wild-type 
AcNPV and transfected with a transfer vector comprising a 
zpeplO polynucleotide operably linked to an AcNPV poly- 
hedrin gene promoter, terminator, and flanking sequences. 
See, King and Possee, The Baculovirus Expression System: 
A Laboratory Guide, London, Chapman & Hall; O'Reilly et 
al., Baculovirus Expression Vectors: A Laboratory Manual, 
New York, Oxford University Press., 1994; and, Richardson, 
Ed., Baculovirus Expression Protocols. Methods in Molecu- 
lar Biology, Totowa, N.J., Humana Press, 1995. Natural 
recombination within an insect cell will result in a recom- 
binant baculovirus which contains zpeplO driven by the 
polybedrin promoter. Recombinant viral stocks are made by 
methods commonly used in the art. 

The second method of making recombinant baculovirus 
utilizes a transposon-based system described by Luckow et 
al. (J. Virol 67:4566-79, 1993). This system is sold in the 
Bac-to-Bac kit (Life Technologies, Rockville, Md.). This 
system utilizes a transfer vector, pFastBacl™ (Life 
Technologies) containing a Tn7 transposon to move the 
DNA encoding the zpeplO polypeptide into a baculovirus 
genome maintained in E. coli as a large plasmid called a 
"bacmid." The pFastBacl™ transfer vector utilizes the 
AcNPV polyhedrin promoter to drive the expression of the 
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gene of interest, in this case zpeplO. However, pFastBacl™ 
can be modified to a considerable degree. The polyhedrin 
promoter can be removed and substituted with the baculovi- 
rus basic protein promoter (also known as Pcor, p6.9 or MP 

5 promoter) which is expressed earlier in the baculovirus 
infection, and has been shown to be advantageous for 
expressing secreted proteins. See, Hill-Perkins and Possee, 
J. Gen, Virol 71:971-6, 1990; Bonning et al.,/. Gen. Virol. 
75:1551-6, 1994; and, Chazenbalk and Rapoport, J. Biol 

10 Chem. 270:1543-9, 1995. In such transfer vector constructs, 
a short or long version of the basic protein promoter can be 
used. Moreover, transfer vectors can be constructed which 
replace the native zpeplO secretory signal sequences with 
secretory signal sequences derived from insect proteins. For 

15 example, a secretory signal sequence from Ecdysteroid 
Glucosyltransferase (EGT), honey bee Melittin (Invitrogen, 
Carlsbad, Calif.), or baculovirus gp67 (PharMingen, San 
Diego, Calif.) can be used in constructs to replace the native 
secretory signal sequence. In addition, transfer vectors can 

20 include an in-frame fusion with DNA encoding an epitope 
tag at the C- or N-terminus of the expressed zpeplO 
polypeptide, for example, a Glu-Glu epitope tag 
(Grussenmeyer et al., ibid.) or FLAG tag (Kodak). Using a 
technique known in the art, a transfer vector containing 

25 zpeplO is transformed into E. coli, and screened for bacmids 
which contain an interrupted lacZ gene indicative of recom- 
binant baculovirus. The bacmid DNA containing the recom- 
binant baculovirus genome is isolated, using common 
techniques, and used to transfect Spodoptera frugiperda 

30 cells, e.g. Sf9 cells. Recombinant virus that expresses 
zpeplO is subsequently produced. Recombinant viral stocks 
are made by methods commonly used the art. 

The recombinant virus is used to infect host cells, typi- 
cally a cell line derived from the fall armyworm, Spodoptera 

35 frugiperda. See, in general, Glick and Pasternak, Molecular 
Biotechnology: Principles and Applications of Recombinant 
DNA, ASM Press, Washington, D.C., 1994. Another suitable 
cell line is the High FiveO™ cell line (Invitrogen) derived 
from Trichoplusia ni (U.S. Pat No. 5,300,435). Commer- 

40 cially available serum-free media are used to grow and 
maintain the cells. Suitable media are Sf900 II™ (Life 
Technologies) or ESF 921 ™ (Expression Systems) for the 
Sf9 cells; and Ex-cellO405™ (JRH Biosciences, Lenexa, 
Kans.) or Express FiveO™ (Life Technologies) for the T. ni 

45 cells. The cells are grown up from an inoculation density of 
approximately 2-5xl0 5 cells to a density of l-2xl0 6 cells at 
which time a recombinant viral stock is added at a multi- 
plicity of infection (MOI) of 0.1 to 10, more typically near 
3. The recombinant virus- infected cells typically produce 

so the recombinant zpeplO polypeptide at 12-72 hours post- 
infection and secrete it with varying efficiency into the 
medium. The culture is usually harvested 48 hours post- 
infection. Centrifugation is used to separate the cells from 
the medium (supernatant). The supernatant containing the 

55 zpeplO polypeptide is filtered through micropore filters, 
usually 0.45 pan pore size. Procedures used are generally 
described in available laboratory manuals (King and Possee, 
ibid.; O'Reilly et al., ibid.; Richardson, C. D., ibid.). Sub- 
sequent purification of the zpeplO polypeptide from the 

60 supernatant can be achieved using methods described 
herein. 

Fungal cells, including yeast cells, can also be used within 
the present invention. Yeast species of particular interest in 
this regard include Saccharomyces cerevisiae, Pichia 
65 pastoris, and Pichia methanolica. Methods for transforming 
S. cerevisiae cells with exogenous DNA and producing 
recombinant polypeptides therefrom are disclosed by, for 
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example, Kawasaki, U.S. Pat. No. 4,599,311; Kawasaki et bacteria such as£. coli, the polypeptide may be retained in 

al., U.S. Pat. No. 4,931,373; Brake, U.S. Pat. No. 4,870,008; the cytoplasm, typically as insoluble granules, or may be 

Welch el al., U.S. Pat. No. 5,037,743; aud Murray et al., U.S. directed to the periplasmic space by a bacterial secretion 

Pat. No. 4,845,075. Transformed cells are selected by phe- sequence. In the former case, the cells are lysed, and the 

notype determined by the selectable marker, commonly drug 5 granules are recovered and denatured using, for example, 

resistance or the ability to grow in the absence of a particular guanidine isothiocyanate or urea. The denatured polypeptide 

nutrient (e.g., leucine). A preferred vector system for use in can then be refolded and dimerized by diluting the 

Saccharomyces cerevisiae is the POT1 vector system dis- denaturant, such as by dialysis against a solution of urea and 

closed by Kawasaki et al. (U.S. Pat. No. 4,931,373), which a combination of reduced and oxidized glutathione, fol- 

allows transformed cells to be selected by growth in io lowed by dialysis against a buffered saline solution. In the 

glucose-containing media. Suitable promoters and termina- latter case, the polypeptide can be recovered from the 

tors for use in yeast include those from glycolytic enzyme periplasmic space in a soluble and functional form by 

genes (see, e.g., Kawasaki, U.S. Pat. No. 4,599,311; Kings- disrupting the cells (by, for example, sonication or osmotic 

man et al., U.S. Pat. No. 4,615,974; and Bitter, U.S. Pat. No. shock) to release the contents of the periplasmic space and 

4,977,092) and alcohol dehydrogenase genes. See also U.S. 15 recovering the protein, thereby obviating the need for dena- 

Pat. Nos. 4,990,446; 5,063,154; 5,139,936 and 4,661,454. turation and refolding. 

Transformation systems for other yeasts, including Transformed or transfected host cells are cultured accord- 

Hansenula polymorpha, Schizosaccharomyces pombe, ing to conventional procedures in a culture medium con- 

Kluyveromyces lactis, Kluyveromyces fragilis, Ustilago taining nutrients and other components required for the 

maydis, Pichia pastoris, Pichia methanolica, Pichia guill- 20 growth of the chosen host cells. A variety of suitable media, 

ermondii and Candida maltosa are known in the art. See, for including defined media and complex media, are known in 

example, Gleeson et al., J. Gen. Microbiol 132:3459-65, the art and generally include a carbon source, a nitrogen 

1986 and Cregg, U.S. Pat. No. 4,882,279. Aspergillus cells source, essential amino acids, vitamins and minerals. Media 

may be utilized according to the methods of McKnightet al., may also contain such components as growth factors or 

U.S. Pat. No. 4,935,349. Methods for transforming A cremo- 25 serum, as required. The growth medium will generally select 

nium chrysogenum are disclosed by Sumino et al., U.S. Pat. for cells containing the exogcnously added DNA by, for 

No. 5,162,228. Methods for transforming Neurospora are example, drug selection or deficiency in an essential nutrient 

disclosed by Lambowitz, U.S. Pat. No. 4,486,533. which is complemented by the selectable marker carried on 

The use of Pichia methanolica as host for the production the expression vector or co-transfected into the host cell. P. 

of recombinant proteins is disclosed in WIPO Publications 30 methanolica cells are cultured in a medium comprising 

WO 97/17450, WO 97/17451, WO 98/02536, and WO adequate sources of carbon, nitrogen and trace nutrients at a 

98/02565. DNA molecules for use in transforming/! metha- temperature of about 25° C. to 35° C. Liquid cultures are 

nolica will commonly be prepared as double-stranded, cir- provided with sufficient aeration by conventional means, 

cular plasmids, which are preferably linearized prior to such as shaking of small flasks or sparging of fermentors. A 

transformation. For polypeptide production in P. 35 preferred culture medium for P. methanolica is YEPD (2% 

methanolica, it is preferred that the promoter and terminator D-glucose, 2% Bacto™ Peptone (Difco Laboratories, 

in the plasmid be that of a P. methanolica gene, such as a P. Detroit, Mich.), 1% Bacto™ yeast extract (Difco 

methanolica alcohol utilization gene (AUG1 or AUG2). Laboratories), 0.004% adenine and 0.006% L-leucine). 

Other useful promoters include those of the dihydroxyac- ZpeplO polypeptides or fragments thereof may also be 

etone synthase (DHAS), formate dehydrogenase (FMD), 40 prepared through chemical synthesis. ZpeplO polypeptides 

and catalase (CAT) genes. To facilitate integration of the may be monomers or multimers; glycosylated or non- 

DNA into the host chromosome, it is preferred to have the glycosylated; pegylated or non-pegylated; and may or may 

entire expression segment of the plasmid flanked at both not include an initial methionine amino acid residue, 

ends by host DNA sequences. A preferred selectable marker Expressed recombinant zpeplO polypeptides (or chimeric 

for use in Pichia methanolica is a P. methanolica ADE2 45 zpeplO polypeptides) can be purified using fractionation 

gene, which encodes phosphoribosyl-5-aminoimidazole car- and/or conventional purification methods and media, 

boxylase (AIRC; EC 4.1.1.21), which allows ade2 host cells Ammonium sulfate precipitation and acid or chaotrope 

to grow in the absence of adenine. For large-scale, industrial extraction may be used for fractionation of samples. Exem- 

processcs where it is desirable to minimize the use of plary purification steps may include hydroxy apatite, size 

methanol, it is preferred to use host cells in which both 50 exclusion, FPLC and reverse-phase high performance liquid 

methanol utilization genes (AUG 1 and AUG2) are deleted. chromatography. Suitable anion exchange media include 

For production of secreted proteins, host cells deficient in derivatized dextrans, agarose, cellulose, polyacrylamide, 

vacuolar protease genes (PEP4 and PRB1) are preferred. specialty silicas, and the like. DEAE Fast-Flow Sepharose 

Electroporation is used to facilitate the introduction of a (Pharmacia, Piscataway, NJ.), PE1, DEAE, QAE and Q 

plasmid containing DNA encoding a polypeptide of interest 55 derivatives are preferred. Exemplary chromatographic 

into P. methanolica cells. It is preferred to transform P. media include those media derivatized with phenyl, butyl, or 

methanolica cells by electroporation using an exponentially octyl groups, such as Phenyl-Sepharose FF (Pharmacia), 

decaying, pulsed electric field having a field strength of from Toyopearl butyl 650 (Toso Haas, Montgomeryville, Pa.), 

2.5 to 4.5 kV/cm, preferably about 3.75 kV/cm, and a time Octyl-Sepharose (Pharmacia) and the like; or polyacrylic 

constant (t) of from 1 to 40 milliseconds, most preferably 60 resins, such as Amberchrom CG 71 (Toso Haas) and the like, 

about 20 milliseconds. Suitable solid supports include glass beads, silica-based 

Prokaryotic host cells, including strains of the bacteria resins, cellulosic resins, agarose beads, cross-linked agarose 
Escherichia, Bacillus and other genera are also useful host beads, polystyrene beads, cross-linked polyacrylamide res- 
cells within the present invention. Techniques for transform- ins and the like that are insoluble under the conditions in 
ing these hosts and expressing foreign DNA sequences 65 which they are to be used. These supports may be modified 
cloned therein are well known in the art (see, e.g., Sambrook with reactive groups that allow attachment of proteins by 
et al., ibid.). When expressing a zpeplO polypeptide in amino groups, carboxyl groups, sulfhydryl groups, hydroxyl 
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groups and/or carbohydrate moieties. Examples of coupling injected cells from a few hours or days (naked cells) to 

chemistries include cyanogen bromide activation, several weeks (embedded cells). 

N-hydroxysuccinimide activation, epoxide activation, sulf- Alginate threads provide a simple and quick means for 

hydryl activation, hydrazide activation, and carboxyl and generating embedded cells. The materials needed to gener- 

amino derivatives for carbodiimide coupling chemistries. 5 ate the alginate threads are readily available and relatively 

These and other solid media are well known and widely used inexpensive. Once made, the alginate threads are relatively 

in the art, and are available from commercial suppliers. strong and durable, both in vitro and, based on data obtained 

Methods for binding receptor polypeptides to support media using the threads, in vivo. The alginate threads are easily 

are well known in the art. Selection of a particular method manipulable and the methodology is scalable for preparation 

is a matter of routine design and is determined in part by the 10 of numerous threads. In an exemplary procedure, 3% algi- 

properties of the chosen support. See, for example, Affinity nate is prepared in sterile H 2 0, and sterile filtered. Just prior 

Chromatography: Principles & Methods, Pharmacia LKB to preparation of alginate threads, the alginate solution is 

Biotechnology, Uppsala, Sweden, 1988. again filtered. An approximately 50% cell suspension 

The zpeplO polypeptides of the present invention can be (containing about 5x10* to about 5xl0 7 cells/ml) is mixed 

isolated by exploitation of their structural features. Within 15 with the 3% alginate solution. One ml of the alginate/cell 

one embodiment of the invention are included a fusion of the suspension is extruded into a 100 mM sterile filtered CaCl 2 

polypeptide of interest and an affinity tag (e.g., solution over a time period of -1 5 min, forming a "thread", 

polyhistidine, Glu — Glu, FLAG, maltose-binding protein, The extruded thread is then transferred into a solution of 50 

an immunoglobulin domain) that may be constructed to mM CaCl 2 , and then into a solution of 25 mM CaCl 2 . The 

facilitate purification. 20 thread is then rinsed with deionized water before coating the 

Protein refolding (and optionally reoxidation) procedures thread by incubating in a 0.01% solution of poly-L-lysine. 

may be advantageously used. It is preferred to purify the Finally, the thread is rinsed with Lactated Ringer's Solution 

protein to >80% purity, more preferably to >90% purity, and drawn from solution into a syringe barrel (without 

even more preferably >95%, and particularly preferred is a needle attached). A large bore needle is then attached to the 

pharmaceutically pure state, that is greater than 99.9% pure 25 syringe, and the thread is intraperitoneally injected into a 

with respect to contaminating macro molecules, particularly recipient in a minimal volume of the Lactated Ringer's 

other proteins and nucleic acids, and free of infectious and Solution. 

pyrogenic agents. Preferably, a purified protein is substan- An alternative in vivo approach for assaying soluble 

tially free of other proteins, particularly other proteins of proteins of the present invention involves viral delivery 

animal origin. 30 systems. Exemplary viruses for this purpose include 

Proteins/polypeptides which bind zpeplO (such as a adenovirus, herpesvirus, vaccinia virus and adeno- 
zpeplO-binding receptor or other membrane glycoprotein) associated virus (AAV). Adenovirus, a double-stranded 
can also be used for purification of zpeplO. The zpeplO- DNA vims, is currently the best studied gene transfer vector 
binding protein/polypeptide is immobilized on a solid for delivery of heterologous nucleic acid (for a review, see 
support, such as beads of agarose, cross-linked agarose, 35 Becker et al., Metn. Cell Biol. 43:161-89, 1994; and Dou- 
glass, cellulosic resins, silica-based resins, polystyrene, glas and Curiel, Science & Medicine 4:44-53, 1997). The 
cross-linked polyacrylamide, or like materials that are stable adenovirus system offers several advantages: adenovirus can 
under the conditions of use. Methods for finking polypep- (i) accommodate relatively large DNA inserts; (ii) be grown 
tides to solid supports are known in the art, and include to high-titer; (iii) infect a broad range of mammalian cell 
amine chemistry, cyanogen bromide activation, 40 types; and (iv) be used with a large number of available 
N-hydroxysuccinimide activation, epoxide activation, sulf- vectors containing different promoters. Also, because aden- 
hydryl activation, and hydrazide activation. The resulting oviruses are stable in the bloodstream, they can be admin- 
medium will generally be configured in the form of a istered by intravenous injection. Some disadvantages 
column, and fluids containing zpeplO polypeptide are (especially for gene therapy) associated with adenovirus 
passed through the column one or more times to allow 45 gene delivery include: (i) very low efficiency integration into 
zpeplO polypeptide to bind to the ligand -binding or receptor the host genome; (ii) existence in primarily episomal form; 
polypeptide. The bound zpeplOpolypeptide is then eluted and (iii) the host immune response to the administered virus, 
using changes in salt concentration, chaotropic agents precluding readministration of the adenoviral vector, 
(guanidinc HC1), or pH to disrupt ligand-rcccptor binding. By deleting portions of the adenovirus genome, larger 

In vitro and in vivo response to soluble zpeplO can also so inserts (up to 7 kb) of heterologous DNA can be accommo- 
be measured using cultured cells or by administering mol- dated. These inserts can be incorporated into the viral DNA 
ecules of the claimed invention to the appropriate animal by direct ligation or by homologous recombination with a 
model. For instance, soluble zpeplO transfected expression co-transfected plasmid. In an exemplary system, the essen- 
host cells may be embedded in an alginate environment and tial El gene has been deleted from the viral vector, and the 
injected (implanted) into recipient animals. Alginate-poly- 55 virus will not replicate unless the El gene is provided by the 
L-lysine micro-encapsulation, permseleclive membrane host cell (the human 293 cell line is exemplary). When 
encapsulation and diffusion chambers have been described intravenously administered to intact animals, adenovirus 
as a means to entrap transfected mammalian cells or primary primarily targets the liver. If the adenoviral delivery system 
mammalian cells. These types of non-immunogenic "encap- has an El gene deletion, the virus cannot replicate in the host 
sulations" or microenvironments permit the transfer of nutri- 60 cells. However, the host's tissue (e.g., liver) will express and 
ents into the microenvironment, and also permit the diffu- process (and, if a secretory signal sequence is present, 
sion of proteins and other macromolecules secreted or secrete) the heterologous protein. Secreted proteins will 
released by the captured cells across the environmental enter the circulation in the highly vascularized liver, and 
barrier to the recipient animal. Most importantly, the cap- effects on the infected animal can be determined, 
sules or microenvironments mask and shield the foreign, 65 The adenovirus system can also be used for protein 
embedded cells from the recipient animal's immune production in vitro. By culturing adenovirus-infected non- 
response. Such microenvironments can extend the life of the 293 cells under conditions where the cells are not rapidly 
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dividing, the cells can produce proteins for extended periods include antibodies that bind with zpeplO, in particular the 

of time. For instance, BHK cells are grown to confluence in extracellular domain of zpeplO (amino acid residues 21-111 

cell factories, then exposed to the adenoviral vector encod- of SEQ ID NO:2). 

ing the secreted protein of interest. The cells are then grown Anti-zpeplO antibodies can be produced using antigenic 
under serum-free conditions, which allows infected cells to 5 zpeplO epitope-bearing peptides and polypeptides. Anti- 
survive for several weeks without significant cell division. genie epitope-bearing peptides and polypeptides of the 
Alternatively, adenovirus vector infected 293S cells can be present invention contain a sequence of at least nine, pref- 
grown in suspension culture at relatively high cell density to erably between 15 to about 30 amino acids contained within 
produce significant amounts of protein (see Garnier et al., SEQ ID NO:2. However, peptides or polypeptides compris- 
Cytotechnol. 15:145-55, 1994). With either protocol, an io ing a larger portion of an amino acid sequence of the 
expressed, secreted heterologous protein can be repeatedly invention, containing from 30 to 50 amino acids, or any 
isolated from the cell culture supernatant. Within the length up to and including the entire amino acid sequence of 
infected 293S cell production protocol, non-secreted pro- a polypeptide of the invention, also are useful for inducing 
teins may also be effectively obtained. antibodies that bind with zpeplO. It is desirable that the 
An assay system that uses a ligand-bioding receptor (or an 15 amino acid sequence of the epitope-bearing peptide is 
antibody, one member of a complement/an ti -complement selected to provide substantial solubility in aqueous solvents 
pair) or a binding fragment thereof, and a commercially (i.e., the sequence includes relatively hydrophilic residues, 
available biosensor instrument (BIAcore™, Pharmacia while hydrophobic residues are preferably avoided). The 
Biosensor, Piscataway, N.J.) may be advantageously hydrophobicity plot provided in the FIGURE provides such 
employed. Such receptor, antibody, member of a 20 information. Using the plot antigenic regions can be 
complement/anti-complement pair or fragment is immobi- selected, such as those found in the fragments, amino acid 
lized onto the surface of a receptor chip. Use of this residue 39-44, 65-70, 38-43, 62-67 and 96-101 of SEQ ID 
instrument is disclosed by Karlsson, J. Immunol Methods NO:2. Moreover, amino acid sequences containing proline 
145:229-40, 1991 and Cunningham and Wells, J. Mol. Biol. residues may be also be desirable for antibody production. 
234:554-63, 1993. A receptor, antibody, member or frag- 25 Polyclonal antibodies to recombinant zpeplOprotein or to 
mcnt is covalcntly attached, using amine or sulfhydryl zpeplO isolated from natural sources can be prepared using 
chemistry, to dextran fibers that are attached to gold film methods well-known to those of skill in the art. See, for 
within the flow cell. A test sample is passed through the cell. example, Green et al., "Production of Polyclonal Antisera," 
If a ligand, epitope, or opposite member of the complement/ in Immunochemical Protocols (Manson, ed.), pages 1-5 
anti-complement pair is present in the sample, it will bind to 30 (Humana Press 1992), and Williams et al., "Expression of 
the immobilized receptor, antibody or member, respectively, foreign proteins in E. coli using plasmid vectors and puri- 
causing a change in the refractive index of the medium, fication of specific polyclonal antibodies," in DNA Cloning 
which is detected as a change in surface plasmon resonance 2: Expression Systems, 2nd Edition, Glover et al. (eds.), 
of the gold film. This system allows the determination of on- page 15 (Oxford University Press 1995). The immunoge- 
and off-rates, from which binding affinity can be calculated, 35 nicity of a zpeplO polypeptide can be increased through the 
and assessment of stoichiometry of binding. As used herein, use of an adjuvant, such as alum (aluminum hydroxide) or 
the term complcmcnt/anti-complement pair denotes non- Freund's complete or incomplete adjuvant. Polypeptides 
identical moieties that form a non-covalently associated, useful for immunization also include fusion polypeptides, 
stable pair under appropriate conditions. For instance, biolin such as fusions of zpeplO or a portion thereof with an 
and avidin (or streptavidin) are prototypical members of a 40 immunoglobulin polypeptide or with maltose binding pro- 
complement/anti-complement pair. Other exemplary tein. The polypeptide immunogen may be a full-length 
complement/anti-complement pairs include receptor/ligand molecule or a portion thereof. If the polypeptide portion is 
pairs, antibody/antigen (or hapten or epitope) pairs, sense/ "hapten-like," such portion may be advantageously joined or 
antisense polynucleotide pairs, and the like. Where subse- linked to a macromolecular carrier (such as keyhole limpet 
quent dissociation of the complement/anti-complemenl pair 45 hemocyanin (KLH), bovine serum albumin (BSA) or tetanus 
is desirable, the complement/anti-complement pair prefer- toxoid) for immunization. 

ably has a binding affinity of <10 9 M" 1 . Although polyclonal antibodies are typically raised in 

ZpeplO polypeptide and other ligand homologs can also animals such as horses, cows, dogs, chicken, rats, mice, 

be used within other assay systems known in the art. Such rabbits, hamsters, guinea pigs, goats or sheep, an anti- 

systems include Scatchard analysis for determination of 50 zpeplO antibody of the present invention may also be 

binding affinity (see Scatchard, Ann. NY Acad. Sci. 51: derived from a subhuman primate antibody. General tech- 

660-72, 1949) and calorimetric assays (Cunningham et al., niques for raising diagnostically and therapeutically useful 

Science 253:545-8, 1991; Cunningham et al., Science antibodies in baboons may be found, for example, in Gold- 

245:821-5, 1991). enberg et al., international patent publication No. WO 

The invention also provides anti-zpeplO antibodies. Anti- 55 91/11465, and in Losman et al., Int. J. Cancer 46:310, 1990. 

bodies to zpeplO can be obtained, for example, using as an Antibodies can also be raised in transgenic animals such as 

antigen the product of a zpeplO expression vector, or zpeplO transgenic sheep, cows, goats or pigs, and may be expressed 

isolated from a natural source. Particularly useful anti- in yeast and fungi in modified forms as will as in mammalian 

zpeplO antibodies "bind specifically" with zpeplO. Antibod- and insect cells. 

ies are considered to be specifically binding if the antibodies 60 Alternatively, monoclonal anti-zpeplOantibodies can be 
bind to a zpeplO polypeptide, peptide or epitope with a generated. Rodent monoclonal antibodies to specific anti- 
binding affinity (KJ of 10 6 M" 1 or greater, preferably 10 7 gens may be obtained by methods known to those skilled in 
M" 1 or greater, more preferably 10 8 M" 1 or greater, and most the art (see, for example, Kohler et al., Nature 256:495 
preferably 10 9 M" 1 or greater. The binding affinity of an (1975), Coligan cl al. (eds.), Current Protocols in 
antibody can be readily determined by one of ordinary skill 65 Immunology, Vol 1, pages 2.5.1-2.6.7 (John Wiley & Sons 
in the art, for example, by Scatchard analysis (Scatchard, 1991), Picksley et al., "Production of monoclonal antibodies 
Ann. NY Acad. Sci. 51:660, 1949). Suitable antibodies against proteins expressed in E. coli," in DNA Cloning 2: 
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Expression Systems, 2nd Edition, Glover et al. (eds.), page by chemicals such as gluteraldehyde (see, for example, 

93 (Oxford University Press 1995)). Sandhu, Crit. Rev. Biotech. 12:437, 1992). 

Briefly, monoclonal antibodies can be obtained by inject- The Fv fragments may comprise and V L chains which 

ing mice with a composition comprising a zpeplO gene are connected by a peptide linker. These single-chain antigen 

product, verifying the presence of antibody production by 5 binding proteins (scFv) are prepared by constructing a 

removing a serum sample, removing the spleen to obtain structural gene comprising DNAsequences encoding the V H 

B-lymphocytes, fusing the B-lymphocytes with myeloma and V L domains which are connected by an oligonucleotide, 

cells to produce hybridomas, cloning the hybridomas, select- The structural gene is inserted into an expression vector 

ing positive clones which produce antibodies to the antigen, which is subsequently introduced into a host cell, such as E. 

culturing the clones that produce antibodies to the antigen, io coli. The recombinant host cells synthesize a single polypep- 

and isolating the antibodies from the bybridoma cultures. tide chain with a linker peptide bridging the two V domains. 

In addition, an anti-zpeplO antibody of the present inven- Methods for producing scFvs are described, for example, by 

tion may be derived from a human monoclonal antibody. Whitlow el al., Methods: A Companion to Methods in 

Human monoclonal antibodies are obtained from transgenic Enzymology 2:97, 1991, also see, Bird et al., Science 

mice that have been engineered to produce specific human 15 242:423, 1988, Ladner et al., U.S. Pat. No. 4,946,778, Pack 

antibodies in response to antigenic challenge. In this et al., Bio/Technology 11:1271, 1993, and Sandhu, supra, 

technique, elements of the human heavy and light chain As an illustration, a scFV can be obtained by exposing 

locus are introduced into strains of mice derived from lymphocytes to zpeplO polypeptide in vitro, and selecting 

embryonic stem cell lines that contain targeted disruptions antibody display libraries in phage or similar vectors (for 

of the endogenous heavy chain and light chain loci. The 20 instance, through use of immobilized or labeled zpeplO 

transgenic mice can synthesize human antibodies specific protein or peptide). Genes encoding polypeptides having 

for human antigens, and the mice can be used to produce potential zpeplO polypeptide binding domains can be 

human antibody-secreting hybridomas. Methods for obtain- obtained by screening random peptide libraries displayed on 

ing human antibodies from transgenic mice are described, phage (phage display) or on bacteria, such as E. coli. 

for example, by Green et al., Nat. Genet. 7:13, 1994, 25 Nucleotide sequences encoding the polypeptides can be 

Lonberg et al., Nature 368:856, 1994, and Taylor et al., Int. obtained in a number of ways, such as through random 

Immun. 6:579, 1994. mutagenesis and random polynucleotide synthesis. These 

Monoclonal antibodies can be isolated and purified from random peptide display libraries can be used to screen for 

hybridoma cultures by a variety of well-established tech- peptides which interact with a known target which can be a 

niques. Such isolation techniques include affinity chroma- 30 protein or polypeptide, such as a ligand or receptor, a 

tography with Protein-A Sepharose, size-exclusion biological or synthetic macromolecule, or organic or inor- 

chromalography, and ion-exchange chromatography (see, ganic substances. Techniques for creating and screening 

for example, Coligan at pages 2.7.1-2.7.12 and pages such random pep tide display libraries are known in the art 

2.9.1-2.9.3; Baines et al., "Purification of Immunoglobulin (Ladner et al., U.S. Pat. No. 5,223,409, Ladner et al., U.S. 

G (IgG)," in Methods in Molecular Biology, Vol. 10, pages 35 Pat. No. 4,946,778, Ladner et al., U.S. Pat. No. 5,403,484, 

79-104 (The Humana Press, Inc. 1992)). Ladner et al., U.S. Pat. No. 5,571,698, and Kay et al., Phage 

For particular uses, it may be desirable to prepare frag- Display of Peptides and Proteins (Academic Press, Inc. 

ments of anti-zpeplO antibodies. Such antibody fragments 1996)) and random peptide display libraries and kits for 

can be obtained, for example, by proteolytic hydrolysis of screening such libraries are available commercially, for 

the antibody. Antibody fragments can be obtained by pepsin 40 instance from Clontech (Palo Alto, Calif.), Invitrogen Inc. 

or papain digestion of whole antibodies by conventional (San Diego, Calif.), New England Biolabs, Inc. (Beverly, 

methods. As an illustration, antibody fragments can be Mass.), and Pharmacia LKB Biotechnology Inc. 

produced by enzymatic cleavage of antibodies with pepsin (Piscataway, N.J.). Random peptide display libraries can be 

to provide a 5S fragment denoted F(ab') 2 . This fragment can screened using the zpeplO sequences disclosed herein to 

be further cleaved using a thiol reducing agent to produce 45 identify proteins which bind to zpeplO. 

3.5 S Fab' monovalent fragments. Optionally, the cleavage Another form of an antibody fragment is a peptide coding 

reaction can be performed using a blocking group for the for a single complementarity-determining region (CDR). 

sulfhydryl groups that result from cleavage of disulfide CDR peptides ("minimal recognition units") can be obtained 

linkages. As an alternative, an enzymatic cleavage using by constructing genes encoding the CDR of an antibody of 

pepsin produces two monovalent Fab fragments and an Fc 50 interest. Such genes are prepared, for example, by using the 

fragment directly. These methods are described, for polymerase chain reaction to synthesize the variable region 

example, by Goldenberg, U.S. Pat. No. 4,331,647, Nisonoff from RNA of antibody-producing cells (see, for example, 

et al., Arch Biochem. Biophys. 89:230, 1960, Porter, Bio- Larrick et al., Methods: A Companion to Methods in Enzy- 

chem. J. 73:119, 1959, Edelman et al., in Methods in mology 2:106, 1991), Courtenay-Luck, "Genetic Manipula- 

Enzymology Vol. 1, page 422 (Academic Press 1967), and by 55 tion of Monoclonal Antibodies," in Monoclonal Antibodies: 

Coligan, ibid. Production, Engineering and Clinical Application, Ritter et 

Other methods of cleaving antibodies, such as separation al. (eds.), page 166 (Cambridge University Press 1995), and 

of heavy chains to form monovalent light-heavy chain Ward et al., "Genetic Manipulation and Expression of 

fragments, further cleavage of fragments, or other Antibodies," in Monoclonal Antibodies: Principles and 

enzymatic, chemical or genetic techniques may also be used, 60 Applications, Birch et al., (eds.), page 137 (Wiley-Liss, Inc. 

so long as the fragments bind to the antigen that is recog- 1995)). 

nized by the intact antibody. Alternatively, an anti-zpeplO antibody may be derived 

For example, Fv fragments comprise an association of from a "humanized" monoclonal antibody. Humanized 

and V L chains. This association can be noncovalent, as monoclonal antibodies are produced by transferring mouse 

described by Inbar et al., Proc. Nat'l Acad. Sci. USA 65 complementary determining regions from heavy and light 

69:2659, 1972. Alternatively, the variable chains can be variable chains of the mouse immunoglobulin into a human 

linked by an intermolecular disulfide bond or cross-linked variable domain. Typical residues of human antibodies are 
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then substituted in the framework regions of the murine 
counterparts. The use of antibody components derived from 
humanized monoclonal antibodies obviates potential prob- 
lems associated with the immunogenicity of murine constant 
regions. General techniques for cloning murine immunoglo- 
bulin variable domains are described, for example, by 
Orlandi et al, Proc. Natl Acad. Sci. USA 86:3833, 1989. 
Techniques for producing humanized monoclonal antibodies 
are described, for example, by Jones et al., Nature 321:522, 
1986, Carter et al., Proc. Natl Acad. Set USA 89:4285, 
1992, Sandhu, Crit. Rev. Biotech. 12:437, 1992, Singer et al., 
J.Immun. 150:2544, 1993, Sudhir (td), Antibody Engineer- 
ing Protocols (Humana Press, Inc. 1995), Kelley, "Engi- 
neering Therapeutic Antibodies," in Protein Engineering: 
Principles and Practice, Cleland et al. (eds.), pages 399-434 
(John Wiley & Sons, Inc. 1996), and by Queen et al., U.S. 
Pat. No. 5,693,762 (1997). 

Polyclonal anti-idiotypc antibodies can be prepared by 
immunizing animals with anti-zpeplO antibodies or anti- 
body fragments, using standard techniques. See, for 
example, Green et al., "Production of Polyclonal Antisera /' 
in Methods In Molecular Biology: Immunochemical 
Protocols, Manson (ed.), pages 1-12 (Humana Press 1992). 
Also, see Coligan, ibid, at pages 2.4.1-2.4.7. Alternatively, 
monoclonal anti-idiotype antibodies can be prepared using 
anti-zpcplO antibodies or antibody fragments as immuno- 
gens with the techniques, described above. As another 
alternative, humanized anti-idiotype antibodies or subhu- 
man primate anti-idiotype antibodies can be prepared using 
the above-described techniques. Methods for producing 
anti-idiotype antibodies are described, for example, by Irie, 
U.S. Pat. No. 5,208,146, Greene, et. al, U.S. Pat. No. 
5,637,677, and Vanhakavi and Minocha, J. Gen. Virol. 
77:1875, 1996. 

Antibodies or polypeptides herein can also be directly or 
indirectly conjugated to drugs, toxins, radionuclides and the 
like, and these conjugates used for in vivo diagnostic or 
therapeutic applications. For instance, polypeptides or anti- 
bodies of the present invention can be used to identify or 
treat tissues or organs that express a corresponding anti- 
complementary molecule (receptor or antigen, respectively, 
for instance). More specifically, zpeplO polypeptides or 
anti-zpeplO antibodies, or bioactive fragments or portions 
thereof, can be coupled to detectable or cytotoxic molecules 
and delivered to a mammal having cells, tissues or organs 
that express the anti -complementary molecule. 

Suitable detectable molecules may be directly or indi- 
rectly attached to the polypeptide or antibody, and include 
radionuclides, enzymes, substrates, cof actors, inhibitors, 
fluorescent markers, cbemiluminescent markers, magnetic 
particles and the like. Suitable cytotoxic molecules may be 
directly or indirectly attached to the polypeptide or antibody, 
and include bacterial or plant toxins (for instance, diphtheria 
toxin, Pseudomonas exotoxin, ricin, abrin and the like), as 
well as therapeutic radionuclides, such as iodine-131, 
rhenium-188 or yttrium-90 (either directly attached to the 
polypeptide or antibody, or indirectly attached . through 
means of a chelating moiety, for instance). Polypeptides or 
antibodies may also be conjugated to cytotoxic drugs, such 
as adriamycin. For indirect attachment of a detectable or 
cytotoxic molecule, the detectable or cytotoxic molecule can 
be conjugated with a member of a complementary/ 
anticomplementary pair, where the other member is bound 
to the polypeptide or antibody portion. For these purposes, 
biotin/streptavidin is an exemplary complementary/ 
anticomplementary pair. 

Soluble zpeplO polypeptides or antibodies to zpeplO can 
be directly or indirectly conjugated to drugs, toxins, radio- 
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nuclides and the like, and these conjugates used for in vivo 
diagnostic or therapeutic applications. For instance, 
polypeptides or antibodies of the present invention can be 
used to identify or treat tissues or organs that express a 

5 corresponding anti-complementary molecule (receptor or 
antigen, respectively, for instance). More specifically, 
zpeplO polypeptides or anti-zpeplO antibodies, or bioactive 
fragments or portions thereof, can be coupled to detectable 
or cytotoxic molecules and delivered to a mammal having 

J0 cells, tissues or organs that express the anti-complementary 
molecule. 

Suitable detectable molecules can be directly or indirectly 
attached to the polypeptide or antibody, and include 
radionuclides, enzymes, substrates, cofactors, inhibitors, 
fluorescent markers, cbemiluminescent markers, magnetic 

15 particles and the like. Suitable cytotoxic molecules can be 
directly or indirectly attached to the polypeptide or antibody, 
and include bacterial or plant toxins (for instance, diphtheria 
toxin, Pseudomonas exotoxin, ricin, abrin and the like), as 
well as therapeutic radionuclides, such as iodine-131, 

20 rhenium-188 or yttrium-90 (either directly attached to the 
polypeptide or antibody, or indirectly attached through 
means of a chelating moiety, for instance). Polypeptides or 
antibodies can also be conjugated to cytotoxic drugs, such as 
adriamycin. For indirect attachment of a detectable or cyto- 

25 toxic molecule, the detectable or cytotoxic molecule can be 
conjugated with a member of a complementary/ 
anticomplementary pair, where the other member is bound 
to the polypeptide or antibody portion. For these purposes, 
biotin/streptavidin is an exemplary complementary/ 

30 anticomplementary pair. 

Such polypeptide-toxin fusion proteins or antibody/ 
fragment-toxin fusion proteins can be used for targeted cell 
or tissue inhibition or ablation (for instance, to treat cancer 
cells or tissues). Alternatively, if the polypeptide has mul- 

35 tiple functional domains (i.e., an activation domain or a 
ligand binding domain, plus a targeting domain), a fusion 
protein including only the targeting domain can be suitable 
for directing a detectable molecule, a cytotoxic molecule or 
a complementary molecule to a cell or tissue type of interest. 

40 In instances where the domain only fusion protein includes 
a complementary molecule, the anti-complementary mol- 
ecule can be conjugated to a detectable or cytotoxic mol- 
ecule. Such domain-complementary molecule fusion pro- 
teins thus represent a generic targeting vehicle for cell/ 

45 tissue-specific delivery of generic anti-complementary- 
detectable/cytotoxic molecule conjugates. The bioactive 
polypeptide or antibody conjugates described herein can be 
delivered intravenously, intraarterial^ or intraductally, or 
may be introduced locally at the intended site of action. 

50 The zpeplO gene is almost exclusively expressed in the 
testis. Low levels of transcript are also seen in a number of 
other tissues, with the kidney accounting for most of the 
ancillary expression. The tissue specificity observed for 
zpeplO suggests a general role in development and regula- 

55 tory control of testicular differentiation and gonadal ste- 
roidogenesis and spermatogenesis. ZpeplO polypeptides, 
agonists and antagonists have enormous potential in both in 
vitro and in vivo applications. 

Development of testicular hormone production can be 

60 divided into early and late steps, with the latter dependent on 
the activation of functionally-determined Leydig cell pre- 
cursors by LI I. However, the factors that control the early 
steps in this process remain unknown (Huhlaniemi, Reprod. 
Fertil. Dev. 7: 1025-35, 1995) suggesting that testis specific 

65 polypeptides such as zpeplO might be responsible for acti- 
vation of a non-steroidogenic, non-LH responsive precursor 
cell. 
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Once Leydig cell differentiation has occurred, production incorporation of 5-bromo-2'-deoxyuridine (BrdU) in the 

of steroid hormones in the testis is dependent on the secre- DNA of proliferating cells (Porstmann et al., J. Immunol. 

tion of the gonadotropins, LH and FSH, by the pituitary. Methods 82:169-79, 1985), and use of tetrazolium salts 

LH stimulates production of testosterone by the Leydig (Mosmann, J. Immunol. Methods 65:55-63, 1983; Alley et 

cells, whereas spermatogenesis depends on both FSH and 5 al., Cancer Res. 48:589-601, 1988; Marshall et al., Growth 

high intra testicular testosterone concentrations. LH and FSH Reg. 5:69-84, 1995; and Scudiero et al., Cancer Res. 

secretion is in turn under control of gonadotrophin releasing 48:4827-33, 1988) and by measuring proliferation using 

hormone (GnRH) produced in the hypothalamus (Kaufman, 3 H-thymidine uptake (Crowley et al., J. Immunol. Meth. 

The neuro endocrine regulation of male reproduction, in: 133:55-66, 1990). Assays measuring differentiation include, 

Male Infertility. Clinical Investigation, Cause Evaluation io for example, measuring cell-surface markers associated with 

and Treatment., F H Comhaire, ed., Chapman and Hall, stage-specific expression of a tissue, enzymatic activity, 

London, pp 29-54, 1996). Since testicular products have functional activity or morphological changes (Watt, FASEB, 

been shown to control LH and FSH production and in turn, 5:281^4, 1991; Francis, Differentiation 57:63-75, 1994; 

these products regulate, testicular function, this suggests a Raes, Adv. Anim. Cell Biol. Technol Bioprocesses, 161-71, 

regulatory role for zpeplO in hormone production by the 15 1989). 

hypothalamic, pituitary, gonadal axis. ZpeplO polypeptides, agonists and antagonists will also 

It is well known that steroidogenesis and spermatogenesis prove useful in the study of spermatogenesis and infertility, 

take place within two different cellular compartments of the In vivo, zpeplO agonists may find application in the treat- 

testes, with Leydig and Sertoli cells responsible for the ment of male infertility. ZpeplO antagonists may be useful 

former and latter, respectively (Saez, Endocrin. Rev. 15: 20 as male contraceptive agents. ZpeplO antagonists are useful 

574-626, 1994). The activity of each of these cell types as research reagents for characterizing sites of ligand- 

appears to be regulated by the secretory products of the receptor interaction. 

other. Sertoli cell derived tumor necrosis factor-a, fibroblast In vivo assays, well known in the art, are available for 

growth factor, interleukin-1 transforming growth factor-B, evaluating the effect of zpeplO ligands and agonists on 

epidermal growth factor/transforming growth factor-a, 25 testes. For example, compounds can be injected intraperi- 

activin, inhibin, insulin-like growth factor- 1, platelet derived toneally for a specific time duration. After the treatment 

growth factor, endothelin, and ariginine-vasopressin have all period, animals are sacrificed and testes removed and 

been shown to regulate Leydig cell function (Saez, Endo- weighed. Testicles are homogenized and sperm head counts 

crin. Rev. 15: 574-626, 1994). Thus, zpeplO might control are made (Meistrich et al., Exp. Cell Res. 99:72-8, 1976). 

or modulate the activities of one or more of these genes. 30 Other activities, for example, chemotaxic activity that may 

The membrane glycoprotein zpeplO may also function as be associated with proteins of the present invention can be 

a binding site for one or more growth factor peptides or analyzed. For example, late stage factors in spermatogenesis 

hormones in much the same way that heparin binds with are involved in egg-sperm interactions and sperm motility, 

platelet-derived growth factor (PDGF), fibroblast growth Activities, such as enhancing viability of cryopreserved 

factors (such as aFGF and bFGF) and vascular endothelial 35 sperm, stimulating the acrosome reaction, enhancing sperm 

growth factor (VFGF) and sequesters them on the cell motility and enhancing egg-sperm interactions may be asso- 

surface. dated with the ligands and agonists of the present invention. 

In men, aging is associated with a progressive decline in Assays evaluating such activities are known (Rosenberger, 

testicular function. These changes are manifest clinically by /. Androl. 11:89-96, 1990; Fuchs, Zentralbl. GynakoL 

decreased virility, vigor, and libido that point towards a 40 11:117-120, 1993; Neurwingeret &l.,Andrologia 22:335-9, 

relative testicular deficiency (Vermeulen, Ann. Med. 1990; Harris et al., Human Reprod. 3:856-60, 1988; and 

25:531^, 1993; Pugeat et al., Horm. Res. 43: 104-10, Jockcnhovcl, Andrologia 22:171-178, 1990; Lessing et al., 

1995). Hormone replacement therapy in elderly men is not Fertil. Steril. 44:406-9, 1985; Zaneveld, In Male Infertility 

currently recommended which suggests that a new therapy Chapter 11, Comhaire Ed., Chapman & Hall, London 1996). 

for the male climacterium would be very valuable. ZpeplO 45 These activities are expected to result in enhanced fertility 

polypeptides, agonists or antagonists, either independently and successful reproduction. 

or in combination with other factors, may be evaluated Localization of zpeplO to testis tissue suggests zpeplO, its 

therapeutically. agonists and/or antagonists may have applications in 

Soluble zpeplO polypeptides, zpeplO agonists and/or enhancing fertilization during assisted reproduction in 

zpeplO antagonists may also have therapeutic value in so humans and in animals. Such assisted reproduction methods 

treatment of testicular cancer, infertility, or in the recovery are known in the art and include artificial insemination, in 

of function following testicular surgery. vitro fertilization, embryo transfer and gamete intrafallopian 

The ability of zpeplO polypeptides and zpeplO agonists to transfer. Such methods are useful for assisting men and 

stimulate prolife ration or differentiation of testicular cells women who may have physiological or metabolic disorders 

can be measured using cultured testicular cells or in vivo by 55 that prevent natural conception. Such methods are also used 

administering molecules of the present invention to the in animal breeding programs, such as for livestock, zoologi- 

appropriate animal model. Cultured testicular cells include cal animals, endangered species or racehorses and could be 

dolphin DBl.Tes cells (CRL-6258); mouse GC-1 spg cells used as methods for the creation of transgenic animals. 

(CRL-2053); TM3 cells (CRL-1714); TM4 cells (CRL- To verify the presence of this capability in zpeplO 

1715); and pig ST cells (CRL-1746), available from Ameri- 60 polypeptides, agonists or antagonists of the present 

can Type Culture Collection, 10801 University Boulevard, invention, such molecules are evaluated with respect to their 

Manassas, Va. Assays measuring cell proliferation or differ- ability to enhance viability of cryopreserved sperm, sperm 

entiation are well known in the art. For example, assays motility, the ability of sperm to penetrate cervical mucus, 

measuring proliferation include such assays as chemosensi- particularly in association with methods of assisted 

tivity to neutral red dye (Cavanaugh et al., Investigational 65 reproduction, according to procedures known in the art (see 

New Drugs 8:347-354, 1990, incorporation of radiolabeled for example, Juang et al, Anim. Reprod. Sci. 20:21-9, 1989; 

nucleotides (Cook et al., Anal. Biochem. 179:1-7, 1989), Juang et H. f Anim. Reprod. Sci. 22:47-53, 1990; Colon etal., 
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Fertil Steril 46:1133-39, 1986; hts&ingtl al., Fertil. SteriL use. The term "pharmaceutically acceptable carrier or 

44:406-9, 1985 and Brenner el al., Fertil Steril. 42:92-6, vehicle" refers to a carrier medium which does not interfere 

1984). If desired, zpeplO polypeptide performance in this with the effectiveness of the biological activity of the active 

regard can be compared to relaxins and the like. In addition, ingredients and which is not toxic to the host or patient. One 

zpeplO polypeptides or agonists or antagonists thereof may 5 skilled in the art may formulate the compounds of the 

be evaluated in combination with one or more proteins to present invention in an appropriate manner, and in accor- 

identify synergistic effects. For example, soluble zpeplO, dance wth accepted practices, such as those disclosed in 

agonists and/or antagonists can be added to "capacitation Remington: The Science and Practice of Pharmacy, 

media", a cocktail of compounds known to activate sperm, fermaro, ed " Mack Publishing Co., Easton, Pa., 19th ed., 

such as caffeine, dibutyl cyclic adenosine monophosphate 10 . * , , . . „ a _ c 

4Wm i_i.M-e.i_- u | * . As used herein, a pharmaceutically effective amount of a 

(dbcAMP) or uieophylhne. Such mixtures have resulted in Q p^p^f agon ^ or ar / agonist , * an amount 

improved reproductive funcUon of the sperm, in particular, suffideDt " gg^ a desired biol ical result . The result can 

sperm motility and zonae penetration (Park et al., Am J be alleviation ofthe signs, symptoms, or causes of a disease, 

Obstet. Gynecol. 158:974-9, 1988; Vandevoort et al., Mol. or othcr desired alleration of a biological system. For 

Repro. Develop. 37:299-304, 1993; Vandevoort and IS example, an effective amount of a polypeptide ofthe present 

Overstreet, J. Androl. 16:327-33, 1995). The capacitation invention is that which provides either subjective relief of 

mixture can then be combined with sperm, an egg or an symptoms or an objectively identifiable improvement as 

egg-sperm mixture prior to fertilization of the egg. noted by the clinician or other qualified observer. Doses of 

In cases where pregnancy is not desired, zpeplO polypeptide will generally be determined by the 
zpeplOpolypeptides or polypeptide fragments may function 20 clinician according to accepted standards, taking into 
as germ-cell-specific antigens for use as components in account the nature and severity of the condition to be treated, 
"immunoconlraceptive" or "anti-fertility" vaccines to patient traits, etc. Determination of dose is within the level 
induce formation of antibodies and/or cell mediated immu- of ordinary skill in the art. The proteins may be administered 
nity to selectively inhibit a process, or processes, critical to for acute treatment, over one week or less, often over a 
successful reproduction in humans and animals. The use of 25 period of one to three days or may be used in chronic 
sperm and testis antigens in the development of an immu- treatment, over several months or years, 
nocontraceptive have been described (OTIern et al., Biol Radiation hybrid mapping is a somatic cell genetic tech- 
Reprod. 52:311-39, 1995; Diekman and Heir, Am. J. nique developed for constructing high-resolution, conligu- 
Reprod. Immunol. 37:111-17, 1997; Zhu and Naz, Proc. ous maps of mammalian chromosomes (Cox et al., Science 
Natl. Acad. Sci. USA 94:4704-9,1997). A vaccine based on 30 250:245-50, 1990). Partial or full knowledge of a gene's 
human chorionic gonadotropbin (HCG) linked to a diphthe- sequence allows the designing of PCR primers suitable for 
ria or tetanus carrier is currently in clinical trials (Talwar et use with chromosomal radiation hybrid mapping panels. 
ai.,Proc. Natl. Acad Sci. USA 91:8532-36, 1994). A single Radiation hybrid mapping panels are commercially avail- 
injection resulted in production of high titer antibodies that able which cover the entire human genome, such as the 
persisted for nearly a year in rabbits (Stevens, Am. J. 35 Stanford G3 RH Panel and the GeneBridge 4 RH Panel 
Reprod Immunol. 29:176--88, 1993). Such methods of (Research Genetics, Inc., Huntsville, Ala.). These panels 
immunocontraception using vaccines would include a enable rapid, PCR based, chromosomal localizations and 
zpeplO testes-specific protein or fragment thereof. The ordering of genes, sequence-tagged sites (STSs), and other 
zpeplO protein or fragments can be conjugated to a carrier nonpolymorphic and polymorphic markers within a region 
protein or peptide, such as tetanus or diphtheria toxoid. An 40 of interest. This includes establishing directly proportional 
adjuvant, as described above, can be included and the physical distances between newly discovered genes of inter- 
protein or fragment can be noncovalently associated with est and previously mapped markers. The precise knowledge 
other molecules to enhance intrinsic immunoreactivity. of a gene's position can be useful in a number of ways 
Methods for administration and methods for determining the including: 1) determining if a sequence is part of an existing 
number of administrations are known in the art. Such a 45 contig and obtaining additional surrounding genetic 
method might include a number of primary injections over sequences in various forms such as YAC-, BAC- or cDNA 
several weeks followed by booster injections as needed to clones, 2) providing a possible candidate gene for an inher- 
maintain a suitable antibody titer. itable disease which shows linkage to the same chromo- 

For pharmaceutical use, pharmaceutically effective somal region, and 3) for cross-referencing model organisms 

amounts of zpeplO therapeutic antibodies, small molecule 50 such as mouse which may be beneficial in helping to 

antagonists or agonists of zpeplO polypeptides, or zpeplO determine what function a particular gene might have, 

polypeptide fragments or soluble zpeplO receptors can be The present invention provides reagents for use in diag- 

formulated with pharmaceutically acceptable carriers for nostic applications. For example, the zpeplO gene, a probe 

parenteral, oral, nasal, rectal, topical, transdermal adminis- comprising zpeplO DNAor RNA, or a subsequence thereof 

nation or the like, according to conventional methods. 55 can be used to determine if the zpeplO gene is present on a 

Formulations may further include one or more diluents, particular chromosome or if a mutation has occurred, 

fillers, emulsifiers, preservatives, buffers, excipients, and the Detectable chromosomal aberrations at the zpeplO gene 

like, and may be provided in such forms as liquids, powders, locus include, but are not limited to, aneuploidy, gene copy 

emulsions, suppositories, liposomes, transdermal patches number changes, insertions, deletions, restriction site 

and tablets, for example. Slow or extended-release delivery 60 changes and rearrangements. These aberrations can occur 

systems, including any of a number of biopolymers within the coding sequence, within introns, or within flank- 

(biological-based systems), systems employing liposomes, ing sequences, including upstream promoter and regulatory 

and polymeric delivery systems, can also be utilized with the regions, and may be manifested as physical alterations 

compositions described herein to provide a continuous or within a coding sequence or changes in gene expression 

long-term source of the zpeplO polypeptide, agonist or 65 level. 

antagonist. Such slow release systems are applicable to In general, these diagnostic methods comprise the steps of 

formulations, for example, for oral, topical and parenteral (a) obtaining a genetic sample from a patient; (b) incubating 
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the genetic sample with a polynucleotide probe or primer as encoding a marker (Feigner et al., Proc. Natl. Acad. Set USA 

disclosed above, under conditions wherein the polynucle- 84:7413-7, 1987; Mackey et H.,Proc. Natl Acad Sci. USA 

otidc will hybridize to complementary polynucleotide 85:8027-31, 1988). The use of lipofection to introduce 

sequence, to produce a first reaction product; and (iii) exogenous genes into specific organs in vivo has certain 

comparing the first reaction product to a control reaction s practical advantages. Molecular targeting of liposomes to 

product. A difference between the first reaction product and specific cells represents one area of benefit. More 

the control reaction product is indicative of a genetic abnor- particularly, directing transfection to particular cells repre- 

mality in the patient. Genetic samples for use within the sents one area of benefit. For instance, directing transfection 

present invention include genomic DNA, cDNA, and RNA to particular cell types would be particularly advantageous 

The polynucleotide probe or primer can be RNA or DNA, to in a tissue with cellular heterogeneity, such as the pancreas, 

and will comprise a portion of SEQ ID NO:l, the comple- liver, kidney, and brain. Lipids may be chemically coupled 

ment of SEQ ID NO:l, or an RNA equivalent thereof. to other molecules for the purpose of targeting. Targeted 

Suitable assay methods in this regard include molecular peptides (e.g., hormones or neurotransmitters), proteins such 

genetic techniques known to those in the art, such as as antibodies, or non-peptide molecules can be coupled to 

restriction fragment length polymorphism (RFLP) analysis, 15 liposomes chemically. 

short tandem repeat (STR) analysis employing PCR It is possible to remove the target cells from the body; to 

techniques, ligation chain reaction (Barany, PCR Methods introduce the vector as a naked DNA plasmid; and then to 

and Applications 1:5-16, 1991), ribonuclease protection re-implant the transformed cells into the body. Naked DNA 

assays, and other genetic linkage analysis techniques known vectors for gene therapy can be introduced into the desired 

in the art (Sambrook et al., ibid.; Ausubel et. al., ibid.; 20 host cells by methods known in the art, e.g., transfection, 

Marian, Chest 108:255-65, 1995). Ribonuclease protection electroporation, microinjection, transduction, cell fusion, 

assays (see, e.g., Ausubel el al., ibid., ch. 4) comprise the DEAE dextran, calcium phosphate precipitation, use of a 

hybridization of an RNA probe to a patient RNA sample, gene gun or use of a DNA vector transporter. See, e.g., Wu 

after which the reaction product (RNA-RNA hybrid) is et al., J. BioL Chem. 267:963-7, 1992; Wu et al., J. Biol. 

exposed to RNase. Hybridized regions of the RNA are 25 Chem. 263:14621-4, 1988. 

protected from digestion. Within PCR assays, a patient's Antisense methodology can be used to inhibit zpeplO 

genetic sample is incubated with a pair of polynucleotide gene translation, such as to inhibit cell proliferation in vivo, 

primers, and the region between the primers is amplified and Polynucleotides that are complementary to a segment of a 

recovered. Changes in size or amount of recovered product zpeplO-encoding polynucleotide (e.g., a polynucleotide as 

are indicative of mutations in the patient. Another PCR- 30 set froth in SEQ ID NO:l) are designed to bind to zpeplO- 

based technique that can be employed is single strand encoding mRNA and to inhibit translation of such mRNA. 

conformational polymorphism (SSCP) analysis (Hayashi, Such antisense polynucleotides are used to inhibit expres- 

PCR Methods and Applications 1:34-8, 1991). sion of zpeplO polypeptide-encoding genes in cell culture or 

Polynucleotides encoding zpeplO polypeptides are useful in a subject, 

within gene therapy applications where it is desired to 35 Transgenic mice, engineered to express the zpeplO gene, 

increase or inhibit zpeplO activity. If a mammal has a and mice that exhibit a complete absence of zpeplO gene 

mutated or absent zpeplO gene, the zpeplO gene can be function, referred to as "knockout mice" (Snouwaert et al., 

introduced into the cells of the mammal. In one Science 257:1083, 1992), may also be generated (Lowell et 

embodiment, a gene encoding a zpeplO polypeptide is al., Nature 366:740-42, 1993). These mice may be 

introduced in vivo in a viral vector. Such vectors include an 40 employed to study the zpeplO gene and the protein encoded 

attenuated or defective DNA virus, such as, but not limited thereby in an in vivo system. 

to, herpes simplex virus (HSV), papillomavirus, Epstein The invention is further illustrated by the following 

Barr virus (EBV), adenovirus, adeno-associated virus non-limiting examples. 
(AAV), and the like. Defective viruses, which entirely or 

almost entirely lack viral genes, are preferred. A defective 45 EXAMPLES 
virus is not infective after introduction into a cell. Use of 

defective viral vectors allows for administration to cells in a Example 1 
specific, localized area, without concern that the vector can 
infect other cells. Examples of particular vectors include, but 
are not limited to, a defective herpes simplex virus 1 (HSV1) 50 The zpeplO polypeptide-encoding polynucleotides of the 
vector (Kaplitt et al., Molec. Cell. Neurosci. 2:320-30, present invention were initially identified by querying an 
1991); an attenuated adenovirus vector, such as the vector EST database for polypeptides containing repetitive patterns 
described by Stratford-Perricaudet et al., J. Clin. Invest. and post-translational processing sites yielding potentially 
90:626-30, 1992; and a defective adeno-associated virus active peptides. The polypeptide encoded by an EST meet- 
vector (Samulski et al., J. Virol. 61:3096-101, 1987; Sam- 55 ing those search criteria was further analyzed and found to 
ulski et al., J. Virol. 63:3822-8, 1989). be a membrane glycoprotein. The EST sequence was from 
In another embodiment, a zpeplO gene can be introduced a testis cell library. Several clones considered likely to 
in a retroviral vector, e.g., as described in Anderson et al., contain the entire coding region were used for sequencing 
U.S. Pat. No. 5,399,346; Mann et al. Cell 33:153, 1983; and resulted in incompletely spliced messages. A minimal 
Temin et al., U.S. Pat. No. 4,650,764; Temin et al., U.S. Pat. 60 nucleotide sequence having all potential introns spliced out 
No. 4,980,289; Markowitz et al., J. Virol. 62:1120, 1988; was generated from these sequences. 
Temin et al., U.S. Pat. No. 5,124,263; International Patent To obtain the complete cDNA sequence a human testis 
Publication No. WO 95/07358, published Mar. 16, 1995 by library was screened. The library was plated in pools of 
Dougherty et al.; and Kuo et al., Blood 82:845, 1993. 12,000. Plasmid DNA was prepared from the plated bacteria 
Alternatively, the vector can be introduced by lipofection in 65 using a Qiagen® plasmid purification column (Qiagen, Inc., 
vivo using liposomes. Synthetic cationic lipids can be used Chatsworth, Calif.) according to the manufacturer's instruc- 
lo prepare liposomes for in vivo transfection of a gene tions. DNA from these pools were used as template DNA to 
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identify pools containing the DNA encoding zpeplO using 
PCR. Oligonucleotide primers ZC16.186, (SEQ ID NO:9) 
and ZC16.187, (SEQ ID NO: 10) were designed from the 
sequence of the EST. One nanogram of template DNA was 
combined with 20 pmoles of each primer in a PCR mixture. 
The reaction mixture was incubated at 94° C. for 5 minutes, 
then run for 35 cycles of 94° C, 30 seconds and 68° C, 30 
seconds; followed by an extension at 68* C. for 7 minutes. 
Pools having the correct sized PCR produce, 290 bp, were 
used as a template for PCR isolation of the 5' end of the 
clones. Sequence specific primer ZC16,186 (SEQ ID NO:9) 
and vector specific primer ZC13.006 (SEQ ID NO: 11) were 
used in PCR reactions as above. PCR products were purified 
by Qiaex II Gel Extraction Kit (Qiagen, Inc.) according to 
manufacturer's instructions and sequenced. Pools which 
contained the clones with the most fully spliced sequence 
were used to transform E. coli and plated to agar. The 
colonies were transferred to nitrocellulose and probed with 
a 290 bp fragment derived above. The probe was radioac- 
tively labeled using the MULTI PRIME DNA labeling kit 
(Amersham, Arlington Heights, 111.) according to the manu- 
facturer's instructions. The probe was purified using a 
NUCTRAP push column (Stratagene). ExpressHyb 
(Clontech) solution was used for prehybridization and as a 
hybridizing solution for the colony lifts. Hybridization took 
place at 65° C. for over 12 hours using 1.2x10° cpm/ml of 
labeled probe. The filters were then washed 4 times at 5 
minutes each in 2x SSC, 0.005% SDS at 25° C. followed by 
2 washes at 20 minutes each in 0.1 x SSC, 0.1% SDS at 50° 
C. with continuous agitation. Plasmid DNA from those 
colonies producing a signal were isolated and sequenced. 
The 1094 bp (SEQ ID NO:l) sequence encoding the a full 
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length zpeplOpolypeptide was isolated. An intron may be 
contained within the 3' untranslated region from base pairs 
560-784 of SEQ ID NO:l. 

5 Example 2 

Tissue Distribution 

Human Multiple Tissue Northern Blots (MTN I, MTN II 
and MTN III; Clontech) were probed to determine the tissue 
distribution of human zpeplO expression. An approximately 
530 bp probe, entirely 3' UTR was derived by restriction 
digest of the clone described above with Not I and Eco RI. 
The restriction digested fragment was visualized by agarose 
gel electrophoresis and purified using Qiaex II (Qiagen, 
Chatsworth, Calif.) according to manufacturer's instruc- 
tions. The probe was radioactively labeled using the MUL- 
TIPRIME DNA labeling kit (Amersham, Arlington Heights, 

20 111.) according to the manufacturer's instructions. The probe 
was purified using a NUCTRAP push column (Stratagene) 
EXPRESSHYB (Clontech) solution was used for prehybrid- 
ization and as a hybridizing solution for the Northern blots. 
Hybridization took place overnight at 65° C. using and the 

25 blots were then washed at 50° C. in lx SSC, 0.1% SDS. A 
1.5 kb transcript corresponding to zpeplO was seen in testis 
and a non-discrete smear was seen in kidney. 

A RNA Master Dot Blot (Clontech) that contained RNAs 
from various tissues that were normalized to 8 housekeeping 
30 genes was also probed and hybridized as described above. 
The highest level of expression was seen in testis with 
significantly reduced expression in kidney. 



SEQUENCE LISTING 



<160> NUMBER OF SEQ ID NOS : 14 

<210> SEQ ID NO 1 

<211> LENGTH: 1094 

<212> TYPE: DNA 

<:213> ORGANISM: Homo sapiens 

<220> FEATURE: 

<221> NAME/KEY: CDS 

<222> LOCATION: (79). ..(504) 

<400> SEQUENCE: 1 

ggcaagggct ggagccaggg ctgeagagea ttccttggct cagctggggc agcgccgccc 60 

catcccccag tggtcctc atg tgg agg ctg gca eta ggc ggg gtt ttc ctg 111 
Met Trp Arg Leu Ala Leu Gly Gly Val Phe Leu 
1 5 10 

gca gec gec cag get tgt gtc ttc tgt cgc etc cca gec cac gac ttg 159 
Ala Ala Ala Gin Ala Cys Val Phe Cys Arg Leu Pro Ala His Asp Leu 
15 20 25 

tea ggc cgc ctg get egg etc tgc age cag atg gag gee agg cag aag 207 
Ser Gly Arg Leu Ala Arg Leu Cys Ser Gin Met Glu Ala Arg Gin Lys 
30 35 40 

gaa tgt ggg gec tec cca gac ttc teg gec ttt gec tta gat gag gtg 25 S 

Glu Cys Gly Ala Ser Pro Asp Phe Ser Ala Phe Ala Leu Asp Glu Val 
45 50 55 

tec atg aac aaa gtc aca gag aag act cac aga gtc ctg agg gtc atg 303 
Ser Met Asn Lys Val Thr Glu Lys Thr His Arg Val Leu Arg Val Met 
60 65 70 75 



ggg ggc age ace acg ctg tac aac tgc tec acc tgc aag ggg acg gag 



351 
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Gly Gly Ser Thr Thr Leu Tyr Asn Cys Ser Thr Cys Lys Gly Thr Glu 
80 85 90 

gtg tec tgc tgg ccc cga aag cgc tgc ttc cca gga agt cag gat ctt 
Val Ser Cys Trp Pro Arg Lys Arg Cys Phe Pro Gly Ser Gin Asp Leu 
95 100 105 

tgg gaa gec aag att ctg etc etc tec ate ttc gga get ttc ctg ctt 
Trp Glu Ala Lye lie Leu Leu Leu Ser lie Phe Gly Ala Phe Leu Leu 
110 115 120 

ctg ggt gtt ctg age etc ctg gtg gag tec cac cac etc caa gca aaa 
Leu Gly Val Leu Ser Leu Leu Val Glu Ser Hie His Leu Gin Ala Lys 
125 130 135 

agt ggc ttg tgaagacget gaaaacctcc cagcctccag ctctaagggg 

Ser Gly Leu 

140 



tatgeactea 


caacttccac 


atcccttgga 


ggggaaccag tcagcccctt agtcccagct 


604 


ccaaagacag 


tctccagacc 


ctaaaaccca 


gaeatecctg cttctggttg gtgagataat 


664 


gaaaaacaag 


aaaatcccca 


aaaacccaga tcccccacaa tcccagtgtc agatggcetc 


724 


ccgggaaccc 


aggcacccac 


agctggaaag 


ttcctcccct ccagccctca accaatcaca 


784 


tggctgtcaa 


caatgecagg 


aaaatatcta 


cagaaggaaa gaatccccta cgccactccc 


844 




cacccccttc 


tgcctgttcc 


gggaaagegg gggcatctgc cccagaagct 


904 


attccaggcc 


ctcctatgac 


tgatggggaa teegggaatg catgttctgg aaaactcacc 


964 


ccactagagt 


gagatcacat 


cagtgggttc 


gcgggcatgc cctccctcca tcgtgttaac 


1024 


agtttgaaat 


cctggcctcc 


ctcagaggcc 


tccatcctgc caggectaag taaaacttgc 


1084 


tgttcatgga 








1094 



<210> SEQ ID NO 2 

<211> LENGTH: 142 

<212> TYPE: PRT 

<213> ORGANISM : Homo sapiens 

<400> SEQUENCE: 2 

Met Trp Arg Leu Ala Leu Gly Gly Val Phe Leu Ala Ala Ala Gin Ala 



Cys Val Phe Cys Arg Leu Pro Ala His Asp Leu Ser Gly Arg Leu Ala 
20 25 30 

Arg Leu Cye Ser Gin Met Glu Ala Arg Gin Lys Glu Cys Gly Ala Ser 
35 40 45 

Pro Asp Phe Ser Ala Phe Ala Leu Asp Glu Val Ser Met Asn Lys Val 
50 55 60 

Thr Glu Lys Thr His Arg Val Leu Arg Val Met Gly Gly Ser Thr Thr 
65 70 75 80 

Leu Tyr Asn Cys Ser Thr Cys Lys Gly Thr Glu Val Ser Cys Trp Pro 
85 90 95 

Arg Lys Arg Cys Phe Pro Gly Ser Gin Asp Leu Trp Glu Ala Lys He 
100 105 110 

Leu Leu Leu Ser He Phe Gly Ala Phe Leu Leu Leu Gly Val Leu Ser 
115 120 125 

Leu Leu Val Glu Ser His His Leu Gin Ala Lys Ser Gly Leu 
130 135 140 



<210> SEQ ID NO 3 
<211> LENGTH: 426 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
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<220> FEATURE: 

<223> OTHER INFORMATION: Degenerate nucleotide aequence encoding the 

zpeplO polypeptide of SEQ ID NO: 2 
<221> NAME/KEY: variation 
<222> LOCATION: (1)...(426) 

<223> OTHER INFORMATION: Each N is independently any nucleotide. 
<400> SEQUENCE: 3 



atgtggmgny 


tngcnytngg 


nggngtntty 


ytngcngcng 


cncargcntg 


ygtnttytgy 


60 


mgnytnccng 


cncaygayyt 


nwsnggnragn 


ytngcnmgny 


tntgywsnca 


ratggargcn 


120 


mgncaraarg 


artgyggngc 


nwsnccngay 


ttywsngcnt 


tygcnytnga 


ygargtnven 


180 


atgaayaarg 


tnacngaraa 


racncaymgn 


gtnytnmgng 


tnatgggngg 


nwsnacnaen 


240 


ytntayaayt 


gyvrsnacntg 


yaarggnacn 


gargtnwsnt 


gytggccnmg 


naarmgntgy 


300 


ttyccnggnw 


encargayyt 


ntgggargcn 


aarathytny 


tnytnwanat 


httyggngen 


360 


ttyytnytny 


tnggngtnyt 


nwsnytnytn 


gtngarwsnc 


aycayytnca 


rgcnaarwsn 


420 


ggnytn 












426 



<210> SEQ ID NO 4 
<211> LENGTH: 17 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Degenerate oligonucleotide probe 
<221> NAME /KEY: variation 
<222> LOCATION: (1)...(17) 

<223> OTHER INFORMATION : Each N is independently any nucleotide 
<400> SEQUENCE: 4 

cargentgyg tnttytg 17 



<210> SEQ ID NO 5 
<211> LENGTH: 17 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Degenerate oligonucleotide probe 
<221> NAME /KEY: variation 
<222> LOCATION: (1)...<17) 

<223> OTHER INFORMATION: Each N is independently any nucleotide. 
<400> SEQUENCE: 5 

caraargart gyggngc 17 



<210> SEQ ID NO 6 

<211> LENGTH: 17 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Degenerate oligonucleotide probe 

<221> NAME/KEY: variation 

<222> LOCATION: (1)...(17) 

<223> OTHER INFORMATION: Each N is independently any nucleotide. 

<400> SEQUENCE: 6 

atgaayaarg rnacnga 17 



<210> SEQ ID NO 7 

<2U> LENGTH: 17 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Degenerate oligonucleotide probe 

<221> NAME/KEY: variation 

<222> LOCATION: (!)...( 17) 
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<223> OTHER INFORMATION: Each N is independently any nucleotide. 
<4 00> SEQUENCE: 7 

grnacngara aracnca 17 

<210> SEQ ID NO 8 
<211> LENGTH: 17 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Degenerate oligonucleotide probe 
<221> NAME/KEY: variation 
<222> LOCATION: (1)...(17) 

<223> OTHER INFORMATION: Each N is independently any nucleotide. 
<400> SEQUENCE: B 

acntgyaarg gnacnga 17 



<210> SEQ ID NO 9 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION : Oligonucleotide ZC16,186 
<4 00> SEQUENCE: 9 

atcagtcata ggagggcctg gaata 25 



<210> SEQ ID NO 10 

<2U> LENGTH: 25 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Oligonucleotide ZC16,187 

<400> SEQUENCE: 10 

tccctgcttc tggttggtga gataa 25 



<210> SEQ ID NO 11 
<2H> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Oligonucleotide ZC 13,006 
<4 00> SEQUENCE: 11 

ggctgtcctc taagcgtcac 20 



<210> SEQ ID NO 12 

<211> LENGTH: 12 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION : Nucleotide contig example 

<4 00> SEQUENCE: 12 

atggcttagc tt 12 



<210> SEQ ID NO 13 
<211> LENGTH : 12 
<212> TYPE : DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Nucleotide contig example 



<400> SEQUENCE: 13 
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tagcttgagt ct 



<210> SEQ ID NO 14 
<211> LENGTH: 12 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Nucleotide contig example 



<4 00> SEQUENCE: 14 
gtcgactacc ga 



What is claimed is: that specifically binds with a polypeptide having the amino 

1. An isolated polypeptide comprising an extracellular acid sequence of SEQ ID N0:2. 

domain, wherein said extracellular domain comprises amino 8. An isolated polypeptide according to claim 7, wherein 

acid residues 22 to 111 of the amino acid sequence of SEQ any difference between said amino acid sequence of said 

ID NO:2. isolated polypeptide and said corresponding amino acid 

2. An isolated polypeptide according to claim 1, wherein sequence of SEQ ID NO:2 is due to a conservative amino 
said polypeptide further comprises a transmembrane domain acid substitution. 

that resides in a carboxyl-terminal position relative to said 9. An isolated polypeptide of claim 7, wherein the amino 
extracellular domain, wherein said transmembrane domain ^ acid percent identity is determined using a FASTA program 

comprises amino acid residues 112 to 133 of the amino acid with ktup=l, gap opening penalty=10, gap extension 

sequence of SEQ ID NO:2. penalty=l, and substitution matrix-blosumG2, with other 

3. An isolated polypeptide according to claim 2, wherein parameters set as default. 

said polypeptide further comprises a cytoplasmic domain 10. An isolated polypeptide comprising the amino acid 
that resides in a carboxyl-terminal position relative to said 3Q sequence of amino acid residue 1 to amino acid residue 20 

transmembrane domain, wherein said cytoplasmic domain of SEQ ID N0:2. 

comprises amino acid residues 134 to 142 of the amino acid 11. An isolated polypeptide selected from the group 

sequence of SEQ ID NO:2. consisting of: 

4. An isolated polypeptide according to claim 2, wherein a ) amino acid residues 21-111 of SEQ ID NO:2; 
said polypeptide further comprises a secretory signal that ^ b) amifl0 acid residues 112 _ 133 of SEQ , D N0;2; 
resides in an amino-terminal position relative to said extra- c) amiflo ^ residues 134 _ H2 Qf SEQ {D NQ;2 
cellular domain, wherein said secretory signal sequence ' . . , . . 1 _„ . ir . kr _ _ 
comprises amino acid residues 1 to 20 of the amino acid d ammo acid rcsiducs ]~ 20 of ID N0:2 ' 
sequence of SEQ ID NO:2. e ) ammo acid residues 21 " 133 of SE Q 10 N0:2 ' 

5. An isolated polypeptide according to claim 1 compris- 0 amino acid residues 112-142 of SEQ ID NO:2; 
ing amino acid residue 1 to amino acid residue 142 of SEQ 40 g) amino acid residues 1-111 of SEQ ID NO:2; and 
ID NO:2. h) amino acid residues 1-133 of SEQ ID NO:2. 

6. An isolated polypeptide according to claim 1, 12. A fusion protein consisting of a first portion and a 
covalcntly linked amino terminally or carboxy terminally to second portion joined by a peptide bond, said first portion 
a moiety selected from the group consisting of affinity tags, comprising a polypeptide according to claim 1, and 
toxins, radionucleotides, enzymes and fluorophores. said second portion comprising another polypeptide. 

7. An isolated polypeptide comprising a sequence of 13. A polypeptide according to claim 1 in combination 
amino acid residues that is at least 80% identical to a amino with a pharmaceutically acceptable vehicle. 

acid residue 21 to amino acid residue 142 of SEQ ID NO:2, 

wherein said polypeptide specifically binds with an antibody * * * * * 



